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A woman holds 
her hand in ice 
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pain tolerance. 
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hands for up to 5 minutes. However, 
for some of those taking opioids, 10 
to 15 seconds is the limit, suggesting 
opioid-induced hypersensitivity to 
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EDITORIAL 


Forecasting the opioid epidemic 


ince 2000, almost half a million Americans have 
died from drug overdoses. This modern plague— 
largely driven by opioid addiction—degrades 
health, saps productivity, spawns crime, and 
devastates families, all at enormous societal cost. 
How did we get here, and what do we do now? 
About 20 years ago, compassionate advocacy 
for better treatment of chronic pain, combined with ag- 
gressive marketing of high- 
dose opioid formulations, 
led to a sharp increase in the 
prescribing of legal opioids 
by physicians in the United 
States. An unintended conse- 
quence of this well-meaning 
movement was that millions 
of Americans became depen- 
dent on opioids. Drug cartels 
seized the opportunity to sell 
heroin as a cheaper alterna- 
tive to this ready-made con- 
sumer base. As users switched 
from high-quality pharmaceu- 
ticals to street drugs of unreli- 
able composition and quality, 
deaths mounted. Mortality 
rates surged further as potent 
illicit synthetic drugs—such 
as fentanyl—were mixed in 
with heroin. 

Law enforcement officials 
have realized that “we can’t 
arrest our way out of the epi- 
demic” and that a new, public health-oriented approach 
is needed. How can a new approach be designed? Bet- 
ter understanding of the underlying system dynamics 
may prove critical. Computational models are routinely 
used by engineers to understand the dynamics of physi- 
cal systems, and models and simulations are increasingly 
being used in public health to understand the dynamics 
of contagious disease epidemics in human populations. 
Today, every major public health agency has access to a 
modeling unit that provides policy decision support for 
epidemics such as HIV/AIDS, influenza, Ebola, and Zika. 

The opioid epidemic can be approached as a dynami- 
cal system, composed of networks of interacting in- 
dividuals including nonusers, users, legal prescribers, 
illicit suppliers, treatment providers, supporters (family 
and friends), law enforcement officials, and others. Data 
on the frequency of opioid drug prescribing, illicit drug 
availability, the natural history of substance use disorder, 


il 
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“..a new, public health-oriented 
approach is needed.” 


treatment effectiveness, availability of support services, 
and other key factors can then be used to construct and 
parameterize models. Borrowing from methods used to 
model contagious diseases, these dynamical models can 
be employed to forecast the likely future trajectory of 
the opioid epidemic in the population (not a single path, 
but a range of possible paths as bounded by uncertainty) 
and then to evaluate—in silico—interventions designed 
to alter the trajectory. Pol- 
icy-makers could estimate 
the likely  effectiveness— 
and ___ cost-effectiveness—of 
possible public health inter- 
ventions such as increasing 
the availability of naloxone 
(medication used to block the 
effects of opioids), decreas- 
ing prescriptions, expanding 
treatment, and widening ed- 
ucation, as well as concurrent 
deployment of two or more of 
these interventions. Compu- 
tational modeling can also 
be used to explore synergies 
between public health and 
law enforcement. 

Increased data openness 
will be needed. Each country 
will have its own data access 
issues. In the United States, 
overdose deaths are available 
from public death records, 
but these are compiled slowly, 
and are not readily available at detailed geographic speci- 
ficity. Prescription data can be obtained from commercial 
vendors and from state monitoring programs, but these 
are either not publicly available or are available only at 
exorbitant cost. Illicit drug seizure data are collected, but 
these are not readily shared outside law enforcement. 
Self-reported drug use patterns are available from na- 
tional surveys, but these do not provide regional or local 
data. Urine drug testing data are collected, but are held as 
proprietary. Finding creative ways to open up and analyze 
these and other data sources can lead to valuable insights 
into the dynamics of the opioid epidemic. 

The opioid epidemic is a complex, dynamical process, 
and it should be approached as such in the development 
and evaluation of policy. A coordinated national opioid 
epidemic modeling program could help solve this diffi- 
cult problem. 


-Donald S. Burke 
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@SamDillenback, tweeting about the elaborate entries in the annual 
NASA-Jet Propulsion Laboratory pumpkin carving competition. 


Nations agree to create world’s largest marine reserve in Antarctica 


key polar habitat will be getting some extra 
protection. After years of negotiations, nations 
agreed on 28 October to establish the world’s 
largest marine reserve in the Ross Sea off 
Antarctica. The deal bans commercial fishing 
in the 1.55-million-square-kilometer swath of 
international waters, but allows research in 28% of the 
preserve. It calls for the reserve to open on 1 December 
2017 and exist for at least 35 years. The pact came after 
Russia, China, and Ukraine dropped their objections 
at a meeting of the Commission for the Conservation 


Cargo ships to report emissions 


LONDON | Inastep toward quantifying and 
reducing the climate impact of the ship- 
ping industry, the International Maritime 
Organization (IMO) announced last week it 
will require large cargo ships to record and 
report their fuel consumption. Countries 
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will provide aggregated annual data on 
their fleets’ fuel use to a database starting in 
2018. IMO’s Marine Environment Protection 
Committee will analyze the data to develop 
a strategy to cut greenhouse gas emissions 
from ships. One option is tightening energy 
efficiency standards for new ships, which 
IMO set in 2011. The committee also cut the 
amount of sulfur allowed in fuel oil, which 


of Antarctic Marine Living Resources—which includes 
24 countries and the European Union—in Hobart, 
Australia. Researchers say the reserve, which borders 
a major ice shelf, includes two biologically rich sea- 
mounts and is home to an estimated one-half of the 
world’s South Pacific Weddell seals, one-third of Adélie 
penguins and Atlantic petrels, and 25% of emperor pen- 
guins. The Wildlife Conservation Society in New York 
City welcomed the move to protect “this unique sea- 
scape,” adding that “the world is coming to grips with 
the need to conserve large swaths of our oceans.” 


contributes to air pollution, by 85% starting 
in 2020. It remains much higher than the 
limit for diesel fuel used by cars and trucks. 


U.S. science test shows gains 
WASHINGTON, D.c. | Students in grades 
four and eight did slightly better in sci- 
ence than their cohorts in 2009 on a test 
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dubbed the Nation’s Report Card. Results 
released last week from the 2015 National 
Assessment of Educational Progress in 
science also showed that the large achieve- 
ment gap between white and minority 
students (a difference of roughly 35 points 
for blacks and 25 points for Hispanics 

on a scale of 1 to 300) narrowed slightly. 
In contrast, scores for the nation’s 12th 
graders didn’t budge, nor did the yawn- 
ing racial disparity shrink. The results for 
younger students are also broken down by 
state, with Tennessee showing the largest 
increase. At the same time, only one-third 
of students earned a proficient score, and 
one-third scored below the basic level. 


Abackup site for the TMT 


LA PALMA, IN SPAIN | The location of the 
Thirty Meter Telescope (TMT) remains 
in the balance as hearings continue 

this month in Hawaii over its disputed 
building permit. This week, the board 

of the TMT International Observatory 
announced its backup site: the Roque 


Magnitude of an earthquake that 
struck 100 kilometers northeast 
of Rome on 30 October—Italy’s 

strongest quake since 1980. 


Amount fronted by the Bill & 
Melinda Gates Foundation and 
the Wellcome Trust to expand 

the release of bacteria-infected 
mosquitoes in South America to 
fight the spread of Zika. 


Fraction of wild animal populations 
lost between 1970 and 2012, the 
Zoological Society of London and 
the World Wildlife Fund report. 
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de los Muchachos on La Palma, one of 
Spain’s Canary Islands. Roque de los 
Muchachos is at an elevation of only 2400 
meters, compared with Mauna Kea’s 4050 
meters, which would make infrared obser- 
vations more challenging: Atmospheric 
moisture absorbs infrared, and the light 
will travel through more atmosphere. But 
La Palma is also an established observa- 
tory with existing infrastructure. Native 
Hawaiian activists object to siting the 
TMT on Mauna Kea, which is considered 
sacred by the Hawaiian religion, and have 
had the TMT’s building permit overturned 
by Hawaii’s supreme court on a technical 
issue. The TMT will be one of the world’s 
largest telescopes when it begins operat- 
ing next decade. 


Turkish academics get ax 


ANKARA | Ina surprise decree over the 
weekend, Turkey’s government announced 
the firing of more than 1000 academics 

at public universities in the wake of a 
failed coup attempt in July. Many of those 


fired are learning of their fate by finding 
themselves on a long list of names that 

the government released. The government 
also announced that it will now directly 
appoint the leaders of Turkey’s universities. 
Insiders say that the government has long 
held indirect control over the election of 
university rectors, and that the new decree 
simply makes the government’s control of 
academic leadership official. 


NEWSMAKERS 


Three Qs 


As a graduate student at the University 
of Cambridge in the United Kingdom 
studying heart valve bioengineering, 
Jacob Brubert spent last summer cavort- 
ing in public: dancing with colleagues as 
polymer molecules, simulating a beating 
heart valve, even juggling. For his creative 
moves and painful honesty—Brubert ends 
his dance in despair as his experiment 
goes haywire—he earned top prize in the 
annual Science/AAAS Dance Your Ph.D. 


Researchers founda — 
striping gene in this 
African striped mous 


Le ag 
ae 


Striped mouse reveals clue to mammalian color patterns 


veryone wants to know how the zebra got its stripes, but zebras are hard to study 
in the lab. So Harvard University evolutionary biologist Hopi Hoekstra and her col- 
leagues turned to the African striped mouse (Rhabdomys pumilio), which lives in 
southern Africa and has alternating dark and light stripes running down its back. 
When they studied skin development in the mouse embryos, they discovered that 
a gene called Alx3 was very active where light stripes form—its protein suppresses the 
formation of dark pigment, Hoekstra and her colleagues report this week in Nature. The 
same proved true where eastern chipmunks develop stripes, and because chipmunks 
and these mice are separated by 70 million years of evolution, Hoekstra and her col- 
leagues think this gene may lead to stripes and other distinctive color patterns across 
the mammals—including zebras. A preliminary study found the Alx3 gene was more 
active in the white parts of zebra skin than in the black parts. “But we will still need to do 
additional work to make any real case,” she cautions. http://scim.ag/Rodentstripes 
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contest: $1000 and a trip to Boston for 
the 2017 AAAS Annual Meeting. Science 
caught up with Brubert, now in medical 
school at the University of Oxford in the 
United Kingdom. 


Q: What motivated your research? 

A: About 2% of people will have a heart 
valve problem in their lifetime, and about 
300 million artificial valves get implanted 
each year. The mechanical ones last, but 
... you have to take anticlotting drugs that 
have painful and dangerous side effects. 
Animal tissue valves don’t have this prob- 
lem, but don’t last as long. We're trying to 


Colored 
chromosomes 
show how our DNA 
packs together in 
the nucleus. 


Your entire genome in 3D 


make polymer valves that last long and are 
gentle on the blood. 


Q: Did your Ph.D. experiments really fail? 
A:1 had a great Ph.D. but ... not everything 
worked. The crucial thing we haven’t fig- 
ured out is how to make the valve durable. 
It can beat for 100 million cycles before 
failing—but that’s only 2 years of human 
life. The natural valve lasts for 3 

billion beats. 


Q: Has the contest changed your life? 
A: Explaining my Ph.D. to friends, fam- 
ily, and strangers on the bus is far more 


e used to think of our 1.8-meter-long genomes as linear molecules of DNA somewhat 

randomly squished into the nucleus. But over this past decade, researchers have 

come to realize very deliberate loops bring specific parts of each chromosome into 

contact to help control what genes are active. Now, biophysicist Marco Di Stefano at the 

National Centre for Genomic Analysis in Barcelona, Spain, and his colleagues have 
used sophisticated statistical approaches to convert experimental data that provided indirect 
information about individual connections into a comprehensive, biologically correct 3D model. 
It shows how all our chromosomes (different colors) pack together in the nucleus, the team 
reported in Scientific Reports. The model is “summarizing a large portion of the knowledge 
we have on the DNA organization in the nucleus,” says Di Stefano, who did the work while a 
graduate student at the International School for Advanced Studies in Trieste, Italy. “It is indeed 
beautiful,” says Job Dekker, a biologist at the University of Massachusetts Medical School in 
Worcester who pioneered methods to determine DNA contact points. 
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amusing now. [But] I now get asked to 
translate every presentation I do into 
interpretive dance! 


FINDINGS 
Ebola mutation boosts infection 


Two separate prominent international 
groups of researchers have found evidence 
that the Ebola virus may have adapted to 
humans during the West African epi- 
demic that occurred from 2013 to 2016. 

As researchers describe in two papers 
published in the 3 November issue of Cell, 
a mutation in one amino acid of the Ebola 
virus made it easier for the virus to infect 
human cells. The papers, which looked at 
more than 1000 genetic sequences from 
West African isolates of the virus, are 
careful not to overstate the meaning of the 
finding: One group writes that they were 
“left with a strong suspicion” that a human 
adaption occurred, and they further noted 
that “it remains an open question whether 
this change led not only to increased 
replication in human hosts, but also to 
increased transmission between humans.” 
More than 28,000 people became infected 
with Ebola virus during the epidemic—by 
far the most human-to-human spread that 
has ever occurred. Although a key driver 
of the explosive thread likely was the fact 
that the virus for the first time made it 

to densely populated cities that had poor 
health infrastructures, these findings do 
raise the possibility that evolutionary 
forces were at play, too. 


First dinosaur brain fossil found 


A decade ago, a fossil hunter was combing 
the beach in southeastern United Kingdom 
when he found a strange, brown pebble. 
The surface of it caught his eye: It was 
smooth and strangely undulating, and 

also slightly crinkly in some places. That 
oddly textured pebble, scientists reported 
last week at the Society for Vertebrate 
Paleontology meeting, is actually an 
endocast—an impression preserved in the 
rock—that represents the first known evi- 
dence of fossilized brain tissue of a dinosaur 
(likely a close relative of Igwanodon, a large, 
herbivorous type of dinosaur that lived 
about 133 million years ago). The structure 
of the brain, studied with scanning electron 
microscopes, reveal similarities to both 
birds and crocodiles. It’s the first known 
evidence of such a dinosaur brain—but now 
that they know to look for it, the research- 
ers say, they might go back and look at 
other endocranial casts of dinosaurs to see 
whether they might contain traces of other 
such structures. http://bit.ly/dinobrains 


sciencemag.org SCIENCE 


IMAGE: SISSA 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


CREDITS: (PHOTO) PAUL NICKLEN, NATIONAL GEOGRAPHIC CREATIVE; (GRAPHIC) J. YOU/SCIENCE; (DATA) WORLD BANK 


CLIMATE CHANGE 


One metric ton of 

CO; costs the Arctic 

... 3 square meters of 
summer Sea ice. 


Sea ice shrinks in step with carbon emissions 


Models may underestimate the pace of ice loss because they are missing Arctic warming 


By Warren Cornwall 


he jet fuel you burned on that flight 
from New York City to London? Say 
goodbye to 1 square meter of Arctic 
sea ice. 
Since at least the 1960s, the shrink- 
age of the ice cap over the Arctic 
Ocean has advanced in lockstep with the 
amount of greenhouse gases humans have 
sent into the atmosphere, according to a 
study published this week in Science. Ev- 
ery additional metric ton of carbon dioxide 
(CO,) puffed into the atmosphere appears to 
cost the Arctic another 3 square meters of 
summer sea ice—a simple and direct obser- 
vational link that has been sitting in data 
beneath scientists’ noses. “It’s really basic,” 
says co-author Dirk Notz, a sea ice 
expert at the Max Planck Institute 
for Meteorology in Hamburg, Ger- 
many. “In retrospect, it sounds like 
something someone should have 
done 20 years ago.” 

If both the linear relationship 
and current emission trends hold 
into the future, the study sug- 
gests the Arctic will be ice free by 
2045—far sooner than some cli- 
mate models predict. The study 
suggests that those models are un- 
derestimating how warm the Arc- 
tic has already become and how 
fast that melting will proceed. And 
it gives the public and policymak- 
ers a concrete illustration of the 
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consequences of burning fossil fuels, says 
Edward Maibach, director of the Center for 
Climate Change Communication at George 
Mason University in Fairfax, Virginia. “Con- 
crete information is always more engaging 
than abstract information,” he says. 
According to the new calculations, for 
instance, the average annual carbon emis- 
sions from a U.S. family of four would claim 
nearly 200 square meters of sea ice. Over 
3 decades, that family would be responsible 
for destroying more than an American foot- 
ball field’s worth of ice—a tangible threat 
to ice-dependent creatures such as polar 
bears. The study also makes for vivid com- 
parisons between nations: Each person in 
the United States, for instance, is responsi- 
ble for the destruction of 10 times as much 


On thin ice 
Based on CO, emissions in 2013, each U.S. resident led to the melting of 49 
square meters of Arctic sea ice—nearly 10 times as much as someone in India. 


Sea ice melted 
per person per 
year (m?) 
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ice each year as someone in India (see 
graphic, below). 

Sea ice retreat is already a poster child 
for global warming, and some earlier stud- 
ies had shown that the retreat closely tracks 
atmospheric CO, levels. But Notz and co- 
author Julienne Stroeve, an expert in sea 
ice satellite measurements at the National 
Snow and Ice Data Center in Boulder, Colo- 
rado, identified a tighter link to human ac- 
tivity by compiling annual human-caused 
greenhouse gas emissions and comparing 
those numbers with historic observations of 
Arctic sea ice coverage during September— 
when sea ice is at its smallest. To avoid 
problems with year-to-year ice fluctuations, 
they used a 30-year moving average of ice 
coverage, allowing them to study the years 
from 1968 to 2000. 

To see whether the linear 
relationship they found be- 
tween emissions and sea ice also 
emerged in computer simulations, 
they checked 36 of the world’s ma- 
jor climate models. In simulations 
in which CO, levels rose 1% every 
year, they found the same telltale 
pattern every time, Stroeve says. 
Yet the models’ sensitivity was off: 
They tended to underestimate the 
amount of ice loss. “Models are 
not perfect,’ Stroeve says. “And if 
you can use observations by them- 
selves to forecast when Arctic ice 
will go away maybe that’s in some 
ways better.” 
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By tracing the trajectory of their obser- 
vations into the future, Stroeve and Notz 
estimate that another 1000 gigatons of 
CO, would push summer sea ice cover- 
age to below 1 million square kilometers, 
an area the size of Texas and New Mexico 
combined. That’s essentially ice free, be- 
cause the remaining ice would be tucked 
into pockets around places like the north- 
ern edge of Greenland, where winds con- 
centrate ice. If today’s carbon emissions of 
35 gigatons per year persist, that translates 
to an ice-free ocean, at least in summer, by 
about 2045. Scientists say the same 1000 
gigatons that would erase sea ice would 
also warm the world by 2°C—the threshold 
that the Paris climate agreement intends to 
stay below. 

Stroeve and Notz aren’t certain why the 
shrinking of the sea ice tracks emissions 
so neatly. But they point to a simple po- 
tential explanation: As rising emissions 
warm the Arctic air, the ice retreats to more 
northerly latitudes, where there is less heat 
from incoming direct sunlight. Notz thinks 
that climate models tend to downplay the 
amount of sea ice lost to each ton of CO, 
because they’re underestimating how warm 
the Arctic is getting. Although that might 
seem easy to check, data for air tempera- 
ture over the Arctic sea ice is quite sparse, 
says Gregory Flato, a sea ice scientist at the 
Canadian Centre for Climate Modelling and 
Analysis in Victoria. 

Alexandra Jahn, a sea ice modeler at 
the University of Colorado in Boulder, says 
Stroeve and Notz’s mechanism “finally ex- 
plains this linear relationship that people 
have been seeing, but didn’t have a physi- 
cal explanation for.” But she thinks further 
studies are needed to rule out other factors 
that might influence sea ice decline, such 
as changes in cloud cover. Stroeve and 
Notz say the impact of clouds and ocean 
temperatures aren’t significant. 

They do acknowledge, however, that 
sea ice half a world away in the Antarctic 
doesn’t track emissions as faithfully. There, 
forces including wind and ocean tempera- 
tures have a stronger impact on sea ice be- 
havior, they say. 

Francois Massonnet, a sea ice modeler 
at the Barcelona Supercomputing Center 
in Spain, cautions against assuming that 
what happened in the recent past will 
last into the future. A curveball, such as a 
shift in the amount of CO, the oceans can 
absorb, could upset the pattern by break- 
ing the correlation between emissions and 
Arctic warming. “It’s tempting, but it’s 
very dangerous to be too confident that 
we can extrapolate linear relationships,” he 
says. “We know that the Arctic climate is 
very nonlinear.” 
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Ambitious web fundraising 
startup fails to meet big goals 


Hundreds of scientists paid fees to have their work featured 
on Benefunder, but very few received donations 


By Mark Harris 


t seemed like a promising deal. Pay up 
to $2000 to have your research project 
featured on a slick website aimed at af- 
fluent donors looking to support a good 
cause and get a tax break, and then 
watch the money roll in. But it hasn’t 
paid off for most of the 650 researchers 
who signed up with Benefunder, a 3-year- 
old firm based in San Diego, California, 
that once envisioned raising more than a 
billion dollars annually for research. Just 
six researchers have received donations to- 
taling some $250,000 through Benefunder, 
Science has learned. The 
startup has had to lay 
off most of its employees 
and is struggling to re- 


“The idea of targeting 


a 10% cut of the donations, Benefunder 
expected to earn some $1 million in 2014, 
$13 million in 2015, and a staggering $165 
million in 2016—representing more than 
$1 billion in donations—according to forms 
Braemer filed with the state of California. 
Benefunder’s vision appealed to hun- 
dreds of scientists, many from top univer- 
sities. Each paid about $500 to have their 
work profiled on the website, and some 
spent up to $1500 more for a short video 
produced by the firm. “The idea of target- 
ing small investors and exploiting a tax 
break strategy felt very innovative,” says 
Francois Baneyx, a chemical engineer at 
the University of Wash- 
ington in Seattle, who 
signed up in 2014. “I had 
reasonable expectations, 


vamp its business model. smallinvestorsand _ perhaps getting $10,000 
Benefunder’s stumble oye to $50,000.” 

represents a cautionary exp loiting a tax But even as Bene- 

tale from the promising bhreak strategy felt funder bulged with 

but sometimes perilous ° ° ” projects, donors re- 

frontier of internet fund- very mnnovative. mained scarce. “We were 


raising. Over the past 
decade, so-called crowd- 
funding websites, includ- 
ing Kickstarter, Indiegogo, and GoFundMe, 
have enabled moviemakers, entrepreneurs, 
and others to raise sometimes vast sums 
from small donors around the world. A few 
sites have specialized in raising money for 
scientists, with varying success. Experi- 
ment has raised nearly $7 million for more 
than 550 researchers since it began 5 years 
ago, with nearly half of proposed projects 
meeting funding goals. But another site, 
Petridish, quietly closed its doors in 2014 
after funding just a few dozen experiments 
over 3 years. 

Benefunder, the brainchild of entre- 
preneur Christian Braemer and Gert 
Lanckriet, a machine learning professor 
at the University of California, San Diego, 
aimed to become a science funding star. 
Instead of seeking donations from rank- 
and-file web users, it targeted affluent con- 
tributors and the organizations they use 
to make charitable donations. Benefunder 
even set up a financial mechanism, known 
as a donor advised fund, to ease a donor’s 
ability to claim U.S. tax breaks. By taking 
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Francois Baneyx, University of 


never able to get off the 
ground,” Braemer says. 
Donor funds “were not 
willing to take the reputational risk [on] 
an unknown entity,” he says. And the firm 
received just a few “small transactions ... a 
bit out of the blue.” 

To stay afloat, Benefunder ramped up 
sales of the profiles and videos. In 2014 and 
2015, it earned more than $660,000 this 
way but attracted just $62,000 in gifts, tax 
forms show. In late 2015, as the firm ran 
out of cash, it abruptly stopped recruiting 
researchers, left some videos unfinished, 
and laid off all but three of the 12 employ- 
ees who worked for it and an allied firm. 

Benefunder is now “licking its wounds,” 
Braemer says, and with the help of new 
investors has “pivoted” to advising large 
wealth management firms on how to match 
their donors with its researchers. It’s not 
clear how that strategy will fare. But the 
firm, Braemer says, no longer expects “any- 
one to go to the website and actually make 
a donation.” 


Mark Harris is a writer in 
Seattle, Washington. 
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AIDS EPIDEMIC 


Controversial HIV vaccine 
strategy gets a second chance 


Modest success in Thailand inspires South Africa trial 


By Jon Cohen 


two-pronged HIV vaccine strategy 
that delivered lackluster results in 
a trial in Thailand 7 years ago will 
get another chance in South Africa. 
Last week, researchers injected the 
first of what they hope will be 5400 
healthy men and women between 18 and 
35; each will receive either a combina- 
tion of two vaccines or placebo shots. The 
$130 million study should show once and 
for all whether the combination—the only 
HIV vaccine so far to show any hints of ben- 
efit in an efficacy study—actually works. 

Some researchers say the results in Thai- 
land were so unimpressive that the massive 
study, funded primarily by the National 
Institute of Allergy and Infectious Dis- 
eases (NIAID) and the Bill & Melinda Gates 
Foundation, amounts to a waste of research 
funds. “That money could be much better 
spent developing promising concepts that 
have a real chance of working,” says Ronald 
Desrosiers, an HIV vaccine researcher 
at the University of Miami in Florida. 
But Anthony Fauci, head of the NIAID in 
Bethesda, Maryland, says there was enough 
evidence of efficacy in the Thai trial to war- 
rant another test. “We’re just going to have 
to let it play out,” Fauci says. 

Trial leaders argue that even a modestly 
efficacious vaccine would help South Africa, 
which has more than 6 million HIV-infected 
people—19% of the world’s total. “There 
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have been all these naysayers,” says Glenda 
Gray, who heads the South African Medical 
Research Council in Cape Town and will 
lead the new study. “Do they not care? It 
means they’re completely out of touch with 
the epidemic and that HIV continues to rav- 
age Africa.” 

The Thai trial, which involved more than 
16,000 people, found that the vaccine re- 
duced the risk of HIV infection by a modest 
31.2%. Volunteers first received a canary- 
pox virus engineered to deliver several HIV 
genes. This was followed by a shot that only 
contained HIV’s surface protein, gp120, plus 
alum, an immune stimulant called an adju- 
vant. Both vaccines were designed to pro- 
tect against strains of HIV in circulation in 
Thailand; the dual approach was meant to 
stimulate different immune responses. 

For the new study, researchers built simi- 
lar vaccines with strains common in South 
Africa but chose a stronger adjuvant called 
MF59. Participants will also receive an extra 
injection of the gp120 vaccine in an attempt 
to make the protection more durable. 

Critics say the problem isn’t just the low 
level of protection seen in Thailand. The 
study also failed to show a mechanism that 
could explain the modest benefit, they ar- 
gue. Researchers studying stored blood 
samples from the Thai study found that 
vaccinated people who did not become in- 
fected had higher levels of antibodies that 
bind to gp120. But in test tube experiments, 
the same binding antibodies do not pre- 
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South Africa's high HIV infection rate makes it an ideal 
place to stage a vaccine trial. Many people becoming 
infected are young, like this woman hospitalized with 
AIDS earlier this year in KwaZulu-Natal. 


vent the virus from infecting cells, which 
some researchers insist is critical for pro- 
tection. “Fundamentally, there’s no logical 
pathway from what we know to this vaccine 
trial,” says virologist Paul Bieniasz of the 
Aaron Diamond AIDS Research Center in 
New York City. “There should be more intel- 
ligent debate about what moves forward in 
the pipeline.” 

Bieniasz and others think vaccine re- 
searchers should focus on other approaches, 
like the rare antibodies that have unusual 
power to neutralize HIV in the test tube 
and work against a huge range of mutant 
strains—crucial for a vaccine against the con- 
stantly mutating virus. Many groups are at- 
tempting to develop vaccines that can teach 
the body how to make these broadly neutral- 
izing antibodies, but progress has been slow. 

Genoveffa Franchini, a vaccine re- 
searcher at the U.S. National Cancer In- 
stitute in Bethesda, has another concern 
about the trial: the new adjuvant. In May, 
Franchini and co-workers reported in Na- 
ture Medicine that the vaccine duo works in 
rhesus macaques when gp120 is combined 
with alum, but not with MF59. “If MF59 is 
doing the same thing that it’s doing in the 
macaques, then you're going to end up hav- 
ing no protection and you won’t know why,’ 
Franchini said at a recent HIV/AIDS confer- 
ence at Cold Spring Harbor Laboratory in 
New York. She suggested MF59 might even 
overstimulate the immune system and cre- 
ate more target cells for the virus to infect, 
leaving vaccinated people more vulnerable 
to infection. “There’s this oath that we take 
when we finish up medical school: Don’t 
harm,” she cautioned. 

But MF59 did work in another mon- 
key study, said Lawrence Corey of the 
Fred Hutchinson Cancer Research Cen- 
ter in Seattle, Washington, who heads the 
NIAID-funded network overseeing the 
study. “People of good faith will have differ- 
ent answers,” he said. 

Gray, the study leader, thinks Franchini 
makes a “compelling” case to add an alum 
arm to the trial—if the researchers can find 
the money. But she says the South African 
study is well designed as is, with many 
checkpoints at which it can be halted if the 
vaccine is harmful or even if it becomes 
clear that vaccinated people are getting 
infected at the same rate as those in the 
placebo group. “Someone has to put their 
stake in the ground and have the courage 
to move forward, knowing we might fail,” 
Gray says. & 
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‘Four-legged snake’ may be 


ancient lizard instead 


Controversy swirls around fossil, which is no longer accessible 


By Carolyn Gramling, in Salt Lake City 


tis a tiny, fragile thing: a squashed skull 

barely a centimeter in length; a sinuous 

curving body about two fingers long; four 

delicate limbs with grasping hands. In a 

major paper last year, researchers called 

this rare fossil from more than 100 mil- 
lion years ago the first known four-legged 
snake. But at a meeting of the Society of Ver- 
tebrate Paleontology (SVP) here last week, 
another team suggested that it’s a marine 
lizard instead. Even as scientists debate the 
identity of this controversial specimen, the 
only one of its kind, it appears to be inacces- 
sible for further study. And paleontologists 
are mad as hell. 

“It’s horrifying,” says Jacques Gauthier, a 
paleontologist at Yale University. As far as 
he’s concerned, if the fossil can’t be studied, 
it doesn’t exist. “For me, the take-home mes- 
sage is that I don’t want to mention the name 
Tetrapodophis ever again.” 

A year ago, researchers led by David 
Martill of the University of Portsmouth in 
the United Kingdom reported in Science 
that the fossil, which they named Tetra- 
podophis amplectus (for four-footed snake), 
was a missing link in the snake evolution- 
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ary tree (24 July 2015, p. 416). Researchers 
knew snakes had evolved from four-limbed 
reptiles, but few transitional forms had 
been discovered, and researchers continue 
to wrangle over whether the first lizards to 
lose their limbs and become snakes were 
terrestrial burrowers or aquatic swimmers. 

Martill and colleagues reported that the 
fossil, which they described as a specimen 


“Tetrapodophis is no 
longer science. ... It’s not 
repeatable, it’s not testable.” 


Jason Head, University of Cambridge 


in a German museum, originated from a 
Brazilian outcrop of the Crato Formation, 
a 108-million-year-old limestone layer rich 
in both marine and terrestrial species. They 
identified snakelike features in the fossil, 
including a long body consisting of more 
than 150 vertebrae, a relatively short tail of 
112 vertebrae, hooked teeth, and scales on 
its belly. Those features, they say, support 
the hypothesis that snakes evolved from 
burrowing ancestors. 


Published by AAAS 


One side of a slab holds the fossil’s 
body and a mold of its skull (top); 
the “counterpart” has the skull and 
a mold of the body. 


But many paleontologists weren’t 
convinced. Last week at the 
meeting, paleontologist Michael 
Caldwell of the University of 
Alberta in Edmonton, Canada, 
and colleagues presented their 
own observations of the specimen, 
rebutting Martill’s paper point 
by point to a standing-room-only 
crowd. 

The new analysis hinges on the 
“counterpart” to the original fos- 
sil, which was also housed in the 
Biirgermeister-Miiller Museum in 
Solnhofen, Germany. When the 
slab of rock containing the fos- 
sil was cracked open, the body of 
the organism stayed mostly in 
one half of the slab, whereas the 
skull was mostly in the other half, 
paired with a mold or impression 
of the body. This counterpart slab, 
Caldwell says, preserved clearer de- 
tails of the skull in particular. In his 
group’s analysis of the counterpart, 
he says, “every single character 
that was identified in the original 
manuscript as being diagnostic of 
a snake was either not the case or 
not observable.” 

For example, in snake skulls, a bone 
called the quadrate is elongated, which 
allows snakes to open their jaws very 
wide. This fossil’s quadrate bone is more 
C-shaped, and it surrounds the animal’s 
hearing apparatus—a “characteristic fea- 
ture” of a group of lizards called squamates, 
says co-author Robert Reisz, a paleonto- 
logist at the University of Toronto in Mis- 
sissauga, Canada. He and Caldwell add that 
although the fossil has more vertebrae in 
its body than in its tail, the tail isn’t short, 
but longer than that of many living lizards. 
They are working on a paper arguing that 
the fossil is probably a dolichosaur, an ex- 
tinct genus of marine lizard. 

Martill and co-author Nicholas Longrich 
of the University of Bath in the United King- 
dom, neither of whom was at the meeting, 
stand firmly behind their original analysis. 
Longrich cites all the snakelike features dis- 
cussed in the original paper. “In virtually 
every single respect [it] looks like a snake, 
except for one little detail—it has arms and 
legs,” he told Science by email. 

Many researchers who attended the talk, 
including Gauthier and _ paleontologist 
Jason Head of the University of Cambridge 
in the United Kingdom, are persuaded that 
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Tetrapodophis is not a snake. But as for 
what it is, there may be as many opinions 
as there are paleontologists. Hong-yu Yi of 
the Institute of Vertebrate Paleontology and 
Paleoanthropology in Beijing, China, says 
she’s reserving judgment on the specimen’s 
identity until further analysis. “I was always 
waiting for a longer description of the speci- 
men. I’m still waiting,” she says. 

That analysis may never happen. Caldwell 
says he went back to the Birgermeister- 
Miller Museum several months ago to 
study the specimen again; separately, Head 
says he also attempted to study the fossil. 
Neither could get access to it. Caldwell says 
that the fossil wasn’t actually part of the 
museum’s collection, but was on loan from 
a private owner. Researchers who declined 
to be named because of ongoing discussions 
around the fossil say that it may have been 
damaged during study, prompting the col- 
lector to restrict access to it. “I don’t even 
know if a publication at this moment is ap- 
propriate because no one else will be able to 
access this specimen,” Yi says. 

In fact, some researchers question 
whether the original paper should have 
been published, because the fossil was not 
officially deposited in a museum or other 
repository, so the authors couldn’t guaran- 
tee that future researchers could access it. 
“T have nothing against” private fossil col- 
lecting, Gauthier says. But when a fossil en- 
ters the scientific literature, he says, “then it 
has to be available. Science requires repeat- 
ability” In response, Science Deputy Editor 
Andrew Sugden in Washington, D.C., says, 
“Our understanding at the time of publica- 
tion and in subsequent correspondence was 
that the specimen was accessible at the mu- 
seum, as stated at the end of the paper.” 

Researchers had raised other questions 
about the fossil’s transport out of Brazil. 
Brazil passed laws in the 1940s making 
all fossils property of the state rather than 
private owners. “Most of the exploration of 
the limestone quarries in that region of the 
country began in the second half of the 20th 
century,’ says Tiago Sim6es, a paleonto- 
logist at the University of Alberta, who was 
also an author on the SVP talk. “So the 
vast majority” of fossils from those areas 
were collected after the law had passed. 
“That really touches on some very sensitive 
ethical boundaries.” 

Head agrees. “The best way to move for- 
ward is to literally erase the specimen from 
our research program. Tetrapodophis is no 
longer science. ... It’s not repeatable, it’s 
not testable. If any good can come out of 
Tetrapodophis, it’s the recognition that we 
have got to maintain scientific standards 
when it comes to fossils ... they have to 
be accessible.” 


SCIENCE sciencemag.org 


———— x=» 


BEHIND THE NUMBERS 


California rules U.S. corporate research 


By Jeffrey Mervis 


very state wants to attract the next 

Alphabet (Google’s parent company). 

Such high-tech firms not only provide 

high-paying jobs, but also help boost 

the research capacity of its universi- 

ties. But the latest data from the Na- 
tional Science Foundation (NSF) show that 
most state economic development officials 
have their work cut out for them: Three 
of every 10 corporate research dollars are 
now spent in California, and over the past 
5 years the Golden State has widened its 
lead over the rest of the country. 

The data come from NSF’s annual Busi- 
ness R&D and Innovation Survey (BRDIS). 
The survey documents a $265 billion en- 
terprise in which some 500 companies ac- 
count for 80% of the total spending. The 
survey finds that California was home to 
$77 billion in corporate research expendi- 
tures in 2013, thanks to companies such as 
Google, Facebook, and Lockheed Martin. 
That total equals the combined spending 
in the next five states: Michigan, Massa- 
chusetts, Washington, Texas, and Illinois. 
Together, the top six states account for 
nearly 57% of the $255 billion in corpo- 
rate research that NSF was able to link 
to a particular location. Yet those states 
comprise just 37% of the U.S. population 
and generate 35% of the country’s gross 
domestic product. 

The report also shows that California’s 
corporate research economy weathered the 
Great Recession better than did the econo- 
mies of other states. California’s research 
output grew 42% between 2008 and 2013, 
according to the survey, compared with 
a 7% increase nationwide. That healthy 
performance boosted California’s share of 
the U.S. total from 25% in 2008 to 30% in 
2013, a figure that is well above California’s 
roughly 13% share of the country’s popula- 
tion and the national GDP. 

The Golden State’s increasing dominance 
of corporate research was fueled by a boom 
in information and communications tech- 
nology services, which grew faster than any 
other industrial research sector in the past 
5 years. California also has benefited from 
the fact that software research is no longer 
done exclusively by information technol- 
ogy (IT) companies like Microsoft, based in 
Seattle, Washington. Overall, software re- 
search, for the first time, accounts for the 
majority of the $121 billion spent on all IT 
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research, which includes research done by 
semiconductor manufacturers. 

The 2008 financial crisis was hard on 
corporate research in states dominated by 
a single sector. For example, downsizing by 
pharmaceutical companies dropped New 
Jersey from second to seventh place in 
industrial research activity. A rebound in 
car sales allowed Michigan, where the au- 
tomotive industry accounts for 74% of the 
state’s overall research activity, to remain a 
distant second behind California. 

The BRDIS also zooms in on research 
activity at the level of metropolitan areas. 
California is home to the top two corri- 
dors: the San Jose-San Francisco-Oakland 
and the Los Angeles-Long Beach regions. 


A big lead 


California is ahead of the pack when it comes to 
corporate research spending ($ billions, 2013). 


California | 76.9 
Michigan | | 144 
Massachusetts || 14.0 
Washington a 14.0 
Texas ei 13.4 


Together, research-intensive companies 
(those spending more than $3 million 
annually) reported performing 35% of 
their research in those two urban corri- 
dors. Washington state’s Seattle-Tacoma- 
Olympia corridor placed third, slightly 
ahead of Michigan’s Detroit-Warren-Ann 
Arbor complex and the Boston-Worcester- 
Providence axis stretching from Massachu- 
setts to Rhode Island. 

The report is a sobering reminder that 
the ongoing competition among states 
to land new research facilities can be a 
winner-take-all struggle. Some 52% of the 
companies with the largest research port- 
folios report doing all of that work at only 
one site (typically their headquarters), 
and 69% perform all of their research at 
just two locations. That reality suggests 
state officials who are California dreamin’ 
might be better off nurturing an in-state 
startup—perhaps a university spinoff— 
rather than trying to lure a high-tech giant 
from another state. 
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Rudolf Jaenisch made the 
first transgenic mouse in 
1974 and has pioneered the 
engineering of CRISPR mice. 
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MICE MADE EASY 


The genome-editing tool CRISPR upends the 
vital business of creating mutant mice 


n the beginning of 2013, Michael Wiles 
sat down with high-level managers of 
the Jackson Laboratory in Bar Harbor, 
Maine, and told them about a novel 
way to cut DNA that had amazing 
power. The lab, called JAX for short, 
genetically engineers mice that it sells 
to researchers under a trademarked 
brand: JAX® Mice, it likes to boast, 
“are the highest quality and most-published 
mouse models in the world.” Wiles evalu- 
ates and develops technologies for the lab, 
and he was convinced that this new tool, 
ingeniously adapted from an immune strat- 
egy that bacteria and archaea use to protect 
themselves from viruses, would revolution- 
ize the way JAX engineered mice. “Of about 
a dozen people, nine were asleep,’ Wiles 
says. “No one had heard of CRISPR.” 

Now, most every mouse developer has. 
JAX and other labs making new mouse 
strains have long relied on a laborious 
multistep process that involves genetically 
altering mouse embryonic stem (ES) cells, 
injecting them into an embryo, and breed- 
ing multiple generations of animals. Even 
JAX’s crack team took up to 2 years to engi- 
neer a mouse. CRISPR replaces all that with 
a molecular complex that can do targeted 
genetic surgery on a fertilized egg. It can 
produce a strain of transformed mice in 
6 months. “It’s night and day,’ Wiles says. 
“We had five or six people working with ES 
cells. They were close friends of mine and I 
said, “You better look for a job.” 

Mice genetically modified to cripple or 
“knock out” genes or to add or “knock in” 
genetic information have become key re- 
search models for a wide array of human 
diseases, from cancer and atherosclerosis 
to Alzheimer’s, osteoarthritis, muscular 
dystrophy, and Parkinson’s. Knockout and 
knockin mice also offer a powerful tool for 
probing the functions of specific genes. 

Most investigators get their engineered 
mice from colleagues or by purchasing 
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them from commercial outfits like JAX 
or academic-based repositories. Popular 
engineered mice, such as JAX’s immuno- 
deficient NOD scid gamma strains, sell for 
as little as a few hundred dollars, but a 
custommade mutant could cost as much as 
$20,000. By making the engineering of mice 
far simpler and cheaper, CRISPR opens 
the way for more labs to do it themselves. 
“When you made knockout mice before, you 
needed some skills,” says Rudolf Jaenisch at 
the Massachusetts Institute of Technology 
(MIT) in Cambridge. “Now, you don’t need 
them anymore. Any idiot can do it.” 

If CRISPR’s talent with rodents is shaking 
up individual labs, it is causing an earthquake 
in an international consortium to knock out 
all 21,000 mouse genes, one by one, in order 
to reveal their functions. The consortium, 


“It’s really changed the time 


and efficiency of getting 
these engineered animals.” 
Tak Mak, University of Toronto 


which includes JAX, has spent $350 million 
to date and is about a quarter of the way to its 
goal. Many investigators hope the speed and 
cost savings of CRISPR will accelerate prog- 
ress. The National Institutes of Health (NIH), 
for one, is so impressed with CRISPR’s ease 
and power that it no longer funds consortium 
investigators to use ES cells. 

But that’s where some mouse engineers 
have second thoughts about the rush to 
CRISPR. Few doubt its potential, but the 
technique is still a work in progress, and its 
ability to alter genomes has one big gap. Al- 
though CRISPR knocks out genes with ease, 
it is less efficient at inserting, or knocking 
in, new DNA. That’s important not just for 
giving an animal a novel function, but also 
for creating a knockin known as a “condi- 
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tional” knockout, an animal model in which 
researchers can turn off a target gene at 
specific times of life or in specific tissues. 

Because CRISPR is less adept at making 
conditional knockouts, William Skarnes, 
who led a team making mutant mouse ES 
cells at the Wellcome Trust Sanger Institute 
in Hinxton, U.K., worries that NIH is over- 
emphasizing the new approach. “The deci- 
sion to abandon the ES resource in favor 
of making simple knockouts is a mistake,” 
Skarnes says. “You still want to make condi- 
tionals through the ES route.” 

CRISPR researchers are now refining the 
technique to do knockins with greater ef- 
ficiency. But that entails tinkering with the 
mechanisms that cells use to repair broken 
DNA, which are critical to their health. “I’m 
cautious about overmanipulation of bio- 
logy to increase efficiency,’ says Steve Murray, 
who helps run JAX’s contribution to the 
Knockout Mouse Phenotyping Program 
(KOMP2), which is part of the international 
consortium. “We're waiting in the wings for 
the wizards in the field to help us with this.” 


CRISPR STANDS FOR “clustered regularly 
interspaced short palindromic repeats,” 
which is a description of the prokaryotic 
genetic material from which it was derived. 
It uses what’s called a guide RNA to send 
biological scissors—usually the CRISPR- 
associated protein, Cas9—to a precise spot in 
a genome. Once Cas9 enzymatically makes 
the cut, the cell tries to heal the wounded 
DNA. One repair mechanism leads to knock- 
outs, whereas the second leads to knockins. 
“All CRISPR does is cut the DNA,’ Wiles says. 
“Everything else is the cell repair system, and 
that’s what we're hitching on to.” 

The cell’s standard response is to try 
to paste the double-stranded DNA back 
together at the break points. This of- 
ten requires eating away or adding a few 
bases—the As, Cs, Ts, and Gs that make up 
DNA—which leads to insertions or dele- 
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See how CRISPR runs 


Using the genome-engineering tool to alter a fertilized egg is a quicker and more efficient way to engineer 
a mouse than the traditional route, which starts by modifying an embryonic stem cell. 
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tions. In effect, the repair effort introduces 
typos into the DNA text, disabling the gene. 

MIT's Jaenisch was the first to show the 
power of CRISPR for producing mouse 
knockouts. In a 2 May 2013 paper in Cell that 
appeared 5 months after researchers first 
showed CRISPR could work in mammalian 
cells, he and co-workers reported that the 


540 4 NOVEMBER 2016 + VOL 354 ISSUE 6312 


technique successfully disrupted five genes 
in a single set of mouse ES cells, something 
that was not possible before. More impor- 
tant, they showed that they could bypass ES 
cells altogether and simultaneously knock 
out two genes in single-celled mouse zy- 
gotes, or fertilized eggs. No longer would 
researchers have to modify ES cells and 
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painstakingly breed several generations of 
mice to produce an animal that carried the 
mutant gene in its egg or sperm cells. And 
researchers who wanted mice with two mu- 
tations would no longer have to interbreed 
single mutants and go through a similarly 
time-consuming, cumbersome process to 
arrive at progeny with the altered germ line. 
As the title of Jaenisch’s paper declared tri- 
umphantly, “One-step generation of mice 
carrying mutations in multiple genes.” 

Since then, more than 500 papers have 
detailed how CRISPR can both knock out 
and knock in genes in mice. “The impact 
its had is enormous,’ says Jaenisch, who 
in 1974 created the first transgenic mouse. 
“Tt’s really changed the time and efficiency 
of getting these engineered animals,’ adds 
biochemist Tak Mak of the University of To- 
ronto in Canada, who was also a pioneer in 
the mouse-mutating business. Mak estimates 
it’s about 30% cheaper to engineer a mouse 
with CRISPR than with ES cells, bringing his 
average cost down to about $100,000. 

CRISPR’s impact is measured in more 
than savings. The ease and speed of the 
technique makes it possible to engineer 
mice on the fly, to solve specific puzzles like 
one that C. C. Hui of the Hospital for Sick 
Children in Toronto recently confronted: a 
knockout in which the missing gene didn’t 
have any observable effect. Hui realized 
that the knocked-out gene was linked to 
another gene that might be compensating 
for it. He took the problem to Lauryl Nutter, 
who oversees mouse making at the Centre 
for Phenogenomics in Toronto. She used 
CRISPR to mutate the offending gene in a 
zygote from the original knockout. “We got 
the zygote, injected it with CRISPR-Cas9, 
and 8 weeks later he had a double mutant 
on the ground,” Nutter says. “That would 
have taken years with ES cells.” 

The revolution is not limited to making 
mice with germline mutations. CRISPR 
has allowed investigators to mutate several 
suspected cancer genes simultaneously in 
the somatic cells of adult mice, for exam- 
ple. CRISPR knockins have also corrected 
disease-causing gene defects in adult mice, 
such as the mutations that cause hemo- 
philia and sickle cell anemia. And several 
groups plan to inject CRISPR into a devel- 
oping mouse; the goal is to create mutations 
that act as barcodes and allow scientists to 
track cell lineages as they differentiate. 

Mousemaking outfits like the Centre 
for Phenogenomics and JAX expect that 
CRISPR will vastly expand the range of mu- 
tants they produce. “Now I can take a really 
exotic mouse that has three genetic modi- 
fications and modify it again,” JAX’s Wiles 
says. “We couldn’t do sequential modifica- 
tion with ES cells. We could breed a mouse 
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Areporter does CRISPR 50: conen 


speak biology fluently, but the molecular complexities of the 

novel genome-editing tool called CRISPR left me as befuddled 

as when I peruse descriptions of the inflationary universe. So 

I decided to test what one investigator told me: CRISPR (for 

“clustered regularly interspaced short palindromic repeats”) 
may sound intimidating, but it is so simple to use that “any 
idiot” could do it. 

I would give it a try. 

CRISPR works best at crippling, or knocking out, genes, so 
that’s how I choose to use it. (For a video, see bit.ly/vid-6312.) But 
I aim high: I target an immune gene that, I theorize, could lead 
to insights into reducing the harm done by Zika virus. (My admit- 
tedly wild hypothesis is that the gene, CD32, may help drive Zika 
virus to copy itself to higher levels if a person was previously in- 
fected with dengue and has antibodies to that virus.) 

Roland Wagner, a postdoc in the lab of Sumit Chanda at the 
Sanford Burnham Prebys Medical Discovery Institute in San 
Diego, California, agreed to 
serve as my CRISPR sensei. An 
experienced rock climber origi- 
nally from Austria, Wagner 
approaches everything me- 
thodically. He pulls up the se- 
quence of the CD32 gene, which 
has five distinct protein-coding 
regions. If we cut the DNA in 
one region, the gene most likely 
would be knocked out: It would 
no longer make its protein. 

CRISPR uses a guide made 
of RNA to direct molecular 
scissors—part of the CRISPR- 
associated protein, or Cas9— 
to exact spots in a genome. 
We could buy the guide RNA 
(gRNA), but the idea appalls 
Wagner. “I would assume it’s 
probably $500 to buy the gRNA, but I wouldn’t know,” he says. 
“We're making our own and we're spending about $5.” 

The gRNA sequence must complement a stretch of 20 nucleo- 
tides on the segment of the CD32 gene we want to cut. But the 
same DNA sequence could recur elsewhere in the genome, lead- 
ing the molecular scissors to cut in the wrong place. Such “off- 
target” effects can cause mayhem, and eliminating them is a key 
goal of those honing their CRISPR skills. To make the match 
more specific, Cas9 requires an additional sequence flanking 
the targeted 20 nucleotides: N-G-G, in which “N” can be any 
nucleotide. Where Cas9 finds N-G-G immediately following 
the 20 nucleotides, it attaches to and opens the double helix, 
allowing the gRNA to bind. Cas9 then cuts each strand of 
the DNA. 

To homebrew our gRNA, Wagner copies the sequence of 
the CD32 segment we've identified and pastes it into a freely 
available database, Optimized CRISPR Design, that looks for a 
matching set of 20 nucleotides followed by N-G-G. There are 
41 options within CD32. The database scans the entire human 
genome to see whether there are identical matches elsewhere— 
potential sites of off-target cuts. We select a sequence that ap- 
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Biologist Roland Wagner (left) watches as Jon Cohen attempts a key 
pipetting step in creating a CRISPR construct. 


pears unique, and then he goes to another website and orders a 
stretch of DNA—an oligonucleotide—with that sequence. 

The oligo arrives, and I lose my modern pipetting virginity. 
I have not worked in a lab since I was an undergraduate more 
than 30 years ago. Back then, I learned a pipetting technique 
that probably was invented by Louis Pasteur: I put a finger in 
my mouth and then sucked up a chemical into a thin glass tube, 
capping it with my fingertip when I had drawn up enough. 

Now, at Wagner’s lab bench, I face a rack of fancy plastic giz- 
mos that look like squirt guns but enable users to suck up pre- 
cise microliters of liquid with a push of a button. My task is to 
pipette the oligo from one tiny test tube into another. The sec- 
ond tube holds a plasmid, which is a circular piece of DNA that 
will act as a Trojan horse. This plasmid, customized for CRISPR 
experiments, already holds the gene for Cas9. It also contains a 
60-nucleotide “hairpin” sequence that ultimately will attach to 
the 20 nucleotides I add to make the full gRNA. 

I use one of the fancy pipettes to move the oligo into the 
plasmid tube, and I also add buffer, water, and an enzyme. If all 
goes well, the enzyme will cut 
open the plasmid, removing a 
piece of its DNA and allowing 
the oligo to take its place. 

All does not go well. 

“Oops!” Wagner says as I pi- 
pette the enzyme. “You failed a 
little bit.” I apparently hit the 
pipette button before submerg- 
ing the tip into the liquid. 

In the end, I manage the 
procedure. After waiting for 
the chemical reactions to take 
place, we take my CRISPR 
plasmid to an electrophoresis 
machine, a tray that has wires 
hooked to it. We add a liquid 
that quickly turns to gelatin, 
and then I pipette a few drops 
of my CRISPR plasmid into 
different lanes on the device. I flip a switch to apply an elec- 
tric current, which should separate the DNA into bands based 
on weight. The small piece of DNA I cut out with the enzyme 
should form a distinct band. 

My gel electrophoresis only has one band, from the plasmid. 

“Tt doesn’t look like it worked,’ Wagner says gently. “I didn’t 
want to be all picky, but it could be that you messed up the en- 
zyme with your pipetting.” He allows that when he was starting 
out, his experiments often failed. “’'d go home and Id say, ‘I hate 
my life;” he confided. “There are a lot of setbacks in science.” 

I’ve already learned that any idiot cannot do CRISPR: It 
takes, at least, basic laboratory skills. 

Wagner conducts the experiment in parallel with me, and his 
plasmid properly incorporates the oligo that will guide Cas9 to 
its target. We then coax a cell line made from an embryonic kid- 
ney into taking up the Trojan horse plasmid. After a few days, we 
isolate the DNA from the cells, amplify it with the polymerase 
chain reaction, and use electrophoresis to show that the CD32 
gene has been cut into pieces. Voila, our knockout worked. 

“You did great,’ Wagner tells me. “Way to go!” 

I did not do great. But CRISPR did its job. 
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with two modifications with another that 
had two modifications and the alleles scat- 
tered like the wind. It would take years to 
get all four modifications.” 

Wiles says this likely won’t affect JAX’s 
bottom line. “Instead of shipping thousands 
of boxes with one variety, we will have hun- 
dreds of boxes with tens of varieties.” 


ON ONE FRONT, however, the CRISPR revolu- 
tion is faltering. Three months after his lab’s 
first CRISPR report, Jaenisch and co-workers 
published a second paper in Cell that sug- 
gested CRISPR could easily perform more 
complex genetic surgery, knocking in chunks 
of DNA rather than simply disabling genes. 
As a demonstration, they used CRISPR to 
knock fluorescent tags into mouse zygotes, 


which lighted up whenever a specific gene 
was turned on. They also created conditional 
mutants, which are key to many research ef- 
forts, including the knockout consortium. 
Conditionals get around a barrier to mak- 
ing knockouts. About one-third of mouse 
genes are essential for embryonic growth; 
the mouse is never born if they are disabled 
from the start. So researchers working with 
ES cells cleverly designed a system called 
Cre-Lox recombination that knocks out 
genes only after the mouse has developed 
enough to survive their loss. It requires add- 
ing extra DNA: Low sequences flanking the 
targeted gene plus a Cre gene, which can 
be turned on to produce an enzyme that 
modifies the DNA between the Loz sites. 
Using CRISPR to insert this same system 
into zygotes, Jaenisch’s team reported mak- 
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ing conditional mice with relatively “high 
efficiency’—about 16% of the zygotes led to 
mouse pups with the correct mutations. 

Skarnes is one of many researchers 
bowled over by Jaenisch’s initial reports, 
but he was disappointed when he tried to 
take the technique into his own lab. “It 
looked from his papers that this was going 
to be straightforward and I was quite confi- 
dent this would make ES obsolete,” Skarnes 
says. “What was disappointing is none of us 
could reproduce at the efficiencies reported 
by Jaenisch. ... It works at 1% or 2% at JAX 
and a lot of projects are failing. It’s really 
not proven to be a robust method.” 

There are several reasons why a CRISPR 
cut more readily leads to a knockout than a 
knockin. To create knockins with CRISPR, 


researchers introduce stretches of “donor” 
DNA~—anything from a few bases to an en- 
tire gene—designed to integrate at the break 
points created by Cas9. Splicing in the do- 
nor DNA requires that its ends match, or 
are “homologous” with, the damaged DNA. 
A process of homologous-directed repair 
(HDR) then stitches the ends together. 

The knockout repair mechanism, which 
is called nonhomologous end joining, can 
happen at any stage in the cell division 
cycle and occurs quickly. HDR, in contrast, 
mainly happens in one phase of the cell 
cycle and is far slower. Some genes, such 
as those Jaenisch selected for his initial 
knockin experiments, are also more con- 
ducive to HDR than others. “The paper re- 
ported what we found,” Jaenisch says. “Now, 
we see there are issues.” 
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CRISPR works in every strain of mouse, whereas ES-cell 
technology is mostly limited to one inbred strain. 


To others, CRISPR’s limitations raise 
questions about the knockout mouse con- 
sortium’s decision to abandon ES cell tech- 
nology. Launched in 2003, the project has 
created a repository of mutant ES cells, 
most of them conditionals, for nearly 18,000 
genes. Any researcher can order a cell line 
and spend a year or more making a needed 
knockout mouse. It has also bred 5011 mu- 
tant mouse strains that have germ line trans- 
mission of the knockout. This summer, as 
part of KOMP2, NIH decided to extend the 
tally of live knockouts to 8000, funding JAX, 
the University of California, Davis, and Bay- 
lor College of Medicine in Houston, Texas, to 
do the work. But it specified that the knock- 
outs should be made using CRISPR alone. 

Colin Fletcher, a mouse geneticist at 

NIH’s National Human Genome 
Research Institute in Rock- 
ville, Maryland, who oversees 
KOMP2, says advisers endorsed 
the switch to CRISPR. “You can’t 
cling to the old technology,’ 
Fletcher says. “A lot of people 
have abandoned the ES cell 
repository and, on the other 
hand, a lot of people have come 


. in to the field because of the 


new technology. People are vot- 
ing with their feet. People are 
putting much more effort into 
making conditional alleles with 
CRISPR rather than making 
ES cells.” 

Skarnes, who has just moved 
from Wellcome to the JAX ge- 
nomic medicine branch in 
Farmington, Connecticut, calls 
the shift premature. But he 
concedes that researchers will 
“eventually” figure out how to 
tweak CRISPR so that it makes 
conditional mutant mice with 
high efficiency. One route is to 
block an enzyme crucial to nonhomologous 
end joining. Another is to enhance a pro- 
tein critical to the HDR process that makes 
knockins possible. Still other investigators 
have toyed with lengthening the cell-cycle 
phase that is most favorable to that repair 
process, zapping zygotes with electric pulses 
to aid the entry of the CRISPR-Cas9 con- 
struct, and creating mutant Cas9s called 
“nickases” that only break a single DNA 
strand and preferentially induce HDR. 

Whatever CRISPR’s shortcomings appear 
to be at this point, Wiles emphasizes that its 
potential for engineering mice should not be 
underestimated. “There is a massive number 
of things CRISPR can do that people are just 
beginning to grasp,” he says. “We're really at 
the very, very early phases of development 
and the tool has infinite possibilities.” 
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Building 
bridges to 
regenerate 
axons 


A secreted factor 
stimulates glia to mend 
spinal cord injury 


By Philip R. Williams and Zhigang He 


fter traumatic spinal cord injury, lo- 

cal damage responses have an impor- 

tant effect on regenerating axons. In 

lower organisms (i.e., zebrafish and 

newt), glial cells and other non-neu- 

ronal cell types proliferate, migrate, 
and differentiate to form a bridge between 
the two ends of a transected spinal cord. 
This glial bridge supports axon regenera- 
tion across the lesion site, enabling func- 
tional recovery. However, in mammals, a 
glial scar composed of mixed cell types and 
extracellular matrix forms to seal such a 
wound. Although early studies emphasized 
the inhibitory nature of this scar, recent 
studies have revealed that in mammals, as 
in lower organisms, it can also serve as a 
bridge to facilitate axon regeneration (J—3). 
Yet, how this glial reaction is regulated re- 
mains largely unknown. On page 630 of 
this issue, Mokalled et al. (4) report that 
connective tissue growth factor a (CTGFa) 
is crucial for directing glial bridging and 
subsequent axon regeneration in a zebra- 
fish model of spinal cord injury. Although 
it remains to be determined whether this is 
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Tissue section of a zebrafish spinal cord reveals regeneration after a transection injury. Enhanced green fluorescent 


; 


protein (green) reports expression of CTGFa in tissue that is bridging the two severed ends. Glia are indicated in 
red using an antibody against glial fibrillary acidic protein. Images were acquired using a Zeiss LSM 700 confocal 
microscope. Shown is a maximum projection of images that were tiled and processed using ZEN imaging software. 
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the case in mammals, this finding could in- 
form the design of neural repair strategies. 
Mokalled et al. examined whether the in- 
jured spinal cord releases factors that coordi- 
nate the glial bridging response. By analyzing 
messenger RNA expressed in the injured 
spinal cord of zebrafish, the authors identi- 
fied a few secreted proteins whose expression 


transform into bridging cells with a typi- 
cal elongated morphology (see the photo). 
In CTGFa mutants, the most obvious defect 
was the reduced proliferation of glial cells. 
Consistently, the proliferation-promoting 
carboxyl-terminal region of CTGFa, but not 
the profibrotic amino-terminal of the protein, 
could mimic the effects of full-length CTGFa. 


Forging the glial bridge 


In a zebrafish model of spinal cord injury, regeneration and functional recovery requires a secreted factor 
that stimulates glial cells to form a bridge across the lesion. 


| Progenitors § Glial cells ff Axon 


Normal CTGFa 
CTGFa promotes division of 

epithelial cell progenitors that 
give rise to bridging glia. 


Lesion 


No CTGFa 
Removing CTGFa reduces 
progenitor proliferation, glial bridge 
size, and axon regeneration. 


in bridging glia and in ependymal cells (the 
glial cells from which bridging glia develop) 
was induced at 1 or 2 weeks after injury. 
One such factor, CTGF, is a cytokine that is 
conserved across species from zebrafish to 
humans (5). Zebrafish express two isoforms, 
CTGFa and CTGFb, whereas mammals ex- 
press only CTGF. This matricellular protein 
is implicated in multiple processes such as 
the formation and remodeling of blood ves- 
sels, bone, and connective tissue. Notably, in 
injury conditions, CTGF is a profibrotic fac- 
tor that stimulates connective tissue forma- 
tion and wound healing. Zebrafish carrying 
a ctgfa mutation exhibited defects in glial cell 
proliferation and bridge formation after spi- 
nal cord injury. Consequently, axon regenera- 
tion and functional recovery were impaired 
in these mutants. Thus, CTGFa secretion 
from glia is required for glial bridging and 
self-repair. In wild-type zebrafish with spi- 
nal cord injury, forced expression of CTGFa 
increased glial bridging, axon regeneration, 
and functional recovery (see the figure). A 
soluble form of human CTGF protein applied 
to the lesion site in zebrafish mimicked the 
effect of zebrafish CTGFa in promoting glial 
bridging and axon regeneration. 

Several steps are needed to form a glial 
bridge after spinal cord transection in fish 
(6). A subset of glial cells must proliferate 
locally, migrate into the lesion, and then 
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CTGFa overexpression 
CTGFa abundance promotes glial 
bridging and functional recovery. 


Thus, it is possible that the proliferation of 
pioneering glial cells is the limiting step and 
CTGFa primarily acts at this point. It is also 
possible that CTGFa has a direct role in other 
steps such as the migration of glial cells and 
their morphological transformation. Because 
human CTGF interacts with various growth 
factors and cytokines and modifies other sig- 
naling pathways, it is possible that zebrafish 
CTGFa enhances the function of other glial 
bridge-promoting factors such as fibroblast 
growth factor (6). 

These results also spur speculation as to 
how observations in regeneration-competent 
zebrafish could be relevant to mammals. As 
with zebrafish, expression of CTGF is also 
induced in reactive astrocytes, invading fi- 
broblasts, and endothelial cells in rats after 
spinal cord lesion (7). Reactive astrocytes in 
the spinal cord lesion site in mammals, par- 
ticularly elongated astrocytes (8—J0), could 
be permissive for axon regeneration. 

With such similarities, why do injured 
axons fail to regenerate in mammals? An 
obvious reason is the injured axons’ poor in- 
trinsic regenerative ability in the adult mam- 
malian central nervous system. With recent 
advances in achieving axon regeneration by 
boosting the intrinsic regenerative ability 
of injured neurons (11), there is a better op- 
portunity to assess which cell types around 
the lesion are permissive and which are in- 
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hibitory. For example, corticospinal axons in 
the mouse can be induced to grow across a 
spinal cord injury site when the corticospi- 
nal neurons no longer express a gene called 
phosphatase and tensin homolog (Pten). In 
this scenario, corticospinal axons associate 
with reactive astrocytes while avoiding fibro- 
blasts and macrophages (J0). These findings 
are consistent with the notion that reactive 
astrocytes are permissive for axon outgrowth, 
and also indicate that fibroblasts and macro- 
phages are likely inhibitory. 

Also unclear is whether all reactive astro- 
cytes are permissive for axon regeneration. It 
appears that the morphology of astrocytes af- 
fects their ability to support regenerating ax- 
ons. In mammals, regenerating axons often 
associate with elongated bipolar astrocytes 
instead of stellate ones (8—J0). It is unknown 
whether different astrocytes reflect different 
lineages or different functional states (im- 
mature or mature) (72-15). In this regard, 
a notable difference between zebrafish and 
mammals is the duration of CTGF expres- 
sion. CTGFa is transiently expressed in the 
zebrafish after spinal cord injury (4), whereas 
CTGF is persistently expressed in spinal 
cord-injured rats (7). In fibrotic lesions, per- 
sistent CTGF expression can drive the initial 
wound healing process into a sustained scar- 
ring response with a mixed cell composition 
and extracellular matrix accumulation (5). 
An analogous scar in spinal cord injury could 
result in a physical and biochemical barrier 
that blocks axon regeneration. It would be in- 
teresting to assess the effects of manipulating 
the timing of CTGF expression on glial bridge 
formation and axon regeneration in both ze- 
brafish and mammals. Furthermore, in hu- 
man patients, inflammatory reactions in the 
spinal cord lesion site lead to cavity forma- 
tion surrounded by a dense glial scar. In such 
a condition, a strategy for promoting glial 
bridges, such as manipulating CTGF expres- 
sion and other pathways that control scar for- 
mation (6, 8), could be useful for promoting 
axon regeneration and functional recovery in 
human patients. 
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A testing time for antimatter 


Precision measurement of antiprotonic helium 


provides a test of physical laws 


By Wim Ubachs 


pectroscopy is the most accurate 
branch of science. Optical transition 
frequencies in isolated atoms and mol- 
ecules can nowadays be measured to 
many-digit accuracies by applying the 
tools developed in the atomic phys- 
ics community: ultrastable lasers, locked via 
frequency-comb lasers to atomic clocks, and 
the techniques to cool and control the motion 
of atoms. Precision measurements on small 
quantum systems can be compared with 
theoretical descriptions of these systems at 
the most fundamental level, allowing physics 
theories to be tested and enabling the search 
for physics beyond the standard model (J). 
On page 610 of this issue, Hori et al. (2) ap- 
ply these tricks of the trade to a small atomic 
quantum system with a built-in antiparticle 
to perform precise spectroscopic measure- 
ment in antiprotonic helium (see the figure). 
The technique of buffer-gas cooling is dem- 
onstrated for the first time on a composite 
matter-antimatter particle. 

The laws of physics conspire to create the 
conditions for producing relatively long-lived 
states in the exotic antiprotonic helium atom 
(epHe). In all other cases of antiprotons stop- 
ping in matter, the characteristic annihila- 
tion process by which a matter particle and 
an antimatter particle end their lives in a 
flash of energy takes place on a time scale of 
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less than a nanosecond. The Heisenberg un- 
certainty principle then prohibits a precision 
measurement. The capture of antiprotons 
in a helium atom, under replacement of an 
electron, is the exception (3). The inclusion 
of the antiproton produces a pHet ionic core 
in a heavy-Rydberg two-particle state. En- 
ergy conservation dictates that pHe* emerges 
in a quantum state n of typically n = 38; in 
the experiment by Hori e¢ al., states with 


Proton @ 
Neutron @ 


Structure of the antiprotonic helium system (epHe). 
The positive He and negative p ions can be considered 
to be bound as a heavy-Rydberg ion (11). For n = 38, the 
typical state in which the antiproton is captured, this 
yields a characteristic distance of 0.16 A. The electron is 
then bound in the field of the two-particle ionic core of 
charge 1, similar to the way it is bound in the hydrogen 
atom at a typical distance of the Bohr radius a, = 0.5 A. 
The quantum-level structure of antiprotonic helium can 
be calculated to 10-digit precision (6, 7). 
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The Antiproton Decelerator at CERN used in the 
preparation of the antiprotonic helium atoms. 


n = 31 to n = 40 are probed. The overlap of 
the wave functions of p and He** is minimal 
if the high-z Rydberg states exhibit a large 
angular momentum—for example, in a state 
such as 2 = 38 with / = 36 or 37 (where / de- 
notes the quantized angular momentum). 
These circular states have typical lifetimes of 
microseconds, long enough to perform a la- 
ser experiment. Narrow spectral lines can be 
measured to determine level energies to high 
accuracy. The Rydberg constant of the heavy- 
Rydberg pHe* ion pair is so large that the 
transitions by which the n-quantum number 
changes by unity lie in the wavelength range 
of the near-infrared, visible, and ultraviolet, 
where Hori et al. have narrowband and sta- 
ble lasers at hand. 

The second electron of the helium atom 
remains in its 1s orbital, and it acts as a spec- 
tator during the capture process. It favorably 
shields the composite newly built electrically 
neutral epHe system during collisions with 
the outside world. This property makes it 
possible for the antiprotonic helium atom to 
survive while it undergoes multiple collisions 
in the surrounding bath of cold helium atoms 
at a temperature of 1.5 K above absolute zero. 
This buffer-gas cooling technique, originally 
developed for the preparation of cold atoms 
and molecules (4), is applied to the exotic 
atom bearing an antiparticle in its structure. 
After a number of collisions, without annihi- 
lation, the exotic atoms are equilibrated and 
take up the temperature of the bath. At 1.5 K, 
the velocity of the particles is very low, and 
hence the usual spectral line broadening due 
to the Doppler effect is suppressed, allowing 
for the measurement of narrow spectral lines. 

The precision frequency measurements 
performed by Hori et al. on 13 energy-level 
transitions in antiprotonic helium, and ob- 
tained with nine-digit precision, can be in- 
terpreted in three ways. First, it is a test of 
charge-parity-time (CPT) invariance, argu- 
ably the most fundamental theorem on sym- 
metries in physics. All calculations in particle 
physics would lose validity if CPT symmetry 
were not valid (3). The exchange of particles 
by their antiparticles (C), reflection in a mir- 
ror (P), and letting time run backwards (T) 
should not cause a change in the measurable 
properties of a physical system. The measure- 
ments verify that CPT is a valid symmetry, at 
least at the level of accuracy now obtained. 

Second, antiprotonic helium is a three- 
particle Coulomb system, the stability of 
which was already investigated by Poincaré 
in the framework of classical mechanics. In 
the fully quantized version of electromagne- 
tism, quantum electro-dynamics (QED), the 
stability of the helium atom (5) and the H,* 
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molecular ion (6) is well established, and the 
bound-state level energies can be calculated 
to 10-digit precision. The same is true for the 
epHe system (6, 7) that is found to withstand 
atest of QED. 

Third, the transition frequencies sensitively 
depend on the mass ratios of the constituent 
particles. Similarly, as in a spectroscopic de- 
termination of the proton-electron mass ratio 
from HD* spectroscopy (8), evaluation of the 
data of Hori et al. yields a value for the an- 
tiproton-electron mass ratio. We learn that 
antiprotons weigh the same as protons, up to 
the 10th digit. 

The frontier of particle physics is com- 
monly approached in the realm of high- 
energy physics, using particle accelerators 
like the Large Hadron Collider (LHC). The 
detection of the Higgs boson at the LHC 
marks the culmination of the standard 
model of physics, but the quest is on to ex- 
plore new physics beyond that. Alternative 
approaches exist in the low-energy domain 
by performing extreme precision measure- 
ments on small atomic and molecular sys- 
tems for which the quantum-level structure 
is calculable (J). This can be done, for ex- 
ample, through the search for an electric 
dipole moment of the electron in molecules 
that might reveal signatures of supersym- 
metry (9). Laser precision measurements of 
optical transitions in molecules constrain 
the existence of higher dimensions beyond 
the 3 + 1 (spatial and time) that we regu- 
larly observe, or the occurrence of a fifth 
force beyond the three forces known in the 
standard model plus gravity (J). Previous 
measurements on antiprotonic helium have 
already set limits on the strength of such 
a hypothetical fifth force at sub-angstr6m 
length scale (10); the present data constrain 
these phenomena even further. 

This exotic atom involving antimatter 
seems a fortunate accident of nature. It ex- 
hibits long-lived (metastable) quantum states 
that can be probed with lasers, and it survives 
the collisional conditions needed to cool its 
kinetic motion, as Hori et al. have demon- 
strated. It is likely that these properties may 
be further exploited to reveal new physics 
in future experiments on this extraordinary 
atom-like particle. 
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CONSERVATION 


Migratory birds under threat 


Habitat degradation and loss, illegal killings, and climate 
change threaten European migratory bird populations 


By Franz Bairlein 


he populations of migratory bird spe- 

cies that breed in Europe and overwin- 

ter in sub-Saharan Africa are declining 

considerably faster than those of non- 

migratory resident species or of migra- 

tory species that overwinter in Europe 
(1). Likely factors are habitat changes due to 
changes in land use, illegal killing and tak- 
ing along the northern African coasts, and 
climate-induced changes in timing of migra- 
tion and breeding. However, not only Euro- 
pean trans-Saharan migrants are declining 
fast. This holds also for North American long- 
distance migrants wintering in Central and 
South America. To halt these declines, preser- 
vation of remaining habitats and restoration 
of habitats both at breeding and nonbreed- 
ing grounds is essential, as well as stopping 
illegal killing and taking of birds along their 
migration routes. 


ILLEGAL KILLING AND TAKING 

Every year, between 11 million and 36 mil- 
lion birds are killed or taken illegally in the 
Mediterranean region (2). The areas of great- 
est concern are in the eastern and central 
Mediterranean, with more than 5 million 
birds taken in both Egypt and Italy and an 
estimated 1 million each in Cyprus and Leba- 
non. Common migratory species such as Eur- 
asian chaffinch, blackcap, and song thrush 
are most affected, but many less common mi- 
gratory species are also taken in substantial 
numbers, including species of global conser- 
vation concern such as red kite and Eurasian 
curlew. On average, the annual illegal killings 
and takings of threatened or near-threatened 
migratory bird species amount to 1.0 to 3.5% 
of their populations (2)—percentages that are 
very likely to have considerable impacts on 
the fate of these species. 

Illegal trapping can cause a collapse in 
population numbers within a short period 
of time. For example, the yellow-breasted 
bunting was abundant in its Eurasian 
breeding range until illegal takings in China 
caused an 84% population decline between 
1980 and 2013 (3). Similarly, the passenger 
pigeon was once the most abundant mi- 
gratory bird in North America, numbering 
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around 3 billion to 5 billion birds in the 
early 19th century. Massive-scale hunting as 
cheap meat resulted in its extinction at the 
turn of the 20th century (4). 


HABITAT DEGRADATION AND LOSS 

Many European migratory birds that over- 
winter in sub-Saharan Africa do not travel 
across those areas of the eastern Mediterra- 
nean most affected by illegal killings. Rather, 
they use a flyway across the western Mediter- 
ranean where illegal taking is much less in- 
tense (see the figure). Other factors must play 
a role in their decline. 

A large number of European migratory 
species overwinter in the dry savannas of 
sub-Saharan Africa. Annual survival of many 
of these species correlates with rainfall in the 
Sahel zone (J). However, despite an increase 
in rainfall in the Sahel in recent decades, bird 
populations have continued to decline. Thus, 
factors other than rainfall must contribute to 
the population declines. Land-use and land- 
cover changes are the most important (5). Be- 
tween 1975 and 2000, agriculture increased 
by 57% in sub-Saharan Africa at the expense 
of natural vegetation, with nearly 5 million 
hectares of forest and nonforest vegetation 
lost per year (6). Most affected are the Sahel 
and Guinea Savanna zones where the major- 
ity of the Eurasian migratory species over- 
winter. An analysis of breeding-bird survey 
trends of 26 long-distance migratory species 
in the United Kingdom shows that wintering 
habitat is the most important determinant 
of population trends, with specialist species 
that occupy either open or woodland habitats 
in Africa showing declines (7). 


CLIMATE CHANGE 

Climate change is another major driver for 
biodiversity changes, including responses of 
bird populations (8). Many migratory species, 
including those that overwinter in sub-Saha- 
ran Africa, now arrive earlier at their spring 
breeding grounds (9). However, different or- 
ganisms do not respond to climate change at 
the same pace, which has led to an ecologi- 
cal mismatch between some consumers and 
their prey (10). 

Dutch pied flycatchers, which overwin- 
ter in sub-Saharan Africa, do not arrive 
earlier at breeding grounds, but the popu- 
lations of their insect food peak earlier as 
a result of warmer spring temperatures. 
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This mismatch between breeding and food 
availability has caused a decline of up to 
90% in Dutch pied flycatcher population 
sizes (11). 

Similarly, an analysis of 242 time se- 
ries over the past five decades for 117 Eu- 
ropean migratory bird species revealed a 
larger population decline for species with a 
larger mismatch between food availability 
and consumer requirement (72). Climate 
change-induced changes in migratory bird 
populations are thus evident, but their rel- 
ative contribution to population changes 
in long-distance migrants wintering re- 
quires further clarification. 


trends, and nonbreeding habitat assessments 
are largely missing for that region. 


OUTLOOK 

Any attempt to understand and ameliorate 
migratory bird losses must consider threats 
far away from their breeding sites. These 
threats could include killing and taking, 
human disturbance at staging sites, pes- 
ticide exposure, or collisions with human 
obstacles such as wind turbines and traffic. 
The most important drivers of population 
declines in migratory bird species are likely 
to be land-use changes and connected habi- 
tat degradation and loss, but few studies have 


Threats to European—African migrants 

Bird populations are in steep decline despite not migrating across 

the blackspots of illegal killing. Habitat degradation and loss are likely the 
most important causes, but climate change also affects populations. 


Climate change 

Earlier arrival at breeding 
grounds causes ecological 
mismatch between birds 


Pied 
and their insect prey. 


flycatcher 


Habitat degradation 
and loss 

Increase in agriculture 
at the expense of natural 
vegetation affects birds 
at breeding sites, winter 
habitats, and along 
migratory routes. 


Illegal killing 
Concentrated in the 
eastern Mediterranean 
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A GLOBAL ISSUE 

Although I have focused on the European-Af- 
rican migration system, many migratory bird 
species are declining around the world. In 
North America, the Breeding Bird Survey—a 
roadside census throughout the United States 
and parts of Canada that has been running 
since 1966—shows that half of the migratory 
bird species are declining (13); declines in 
long-distance Neotropical migrants are more 
pronounced than those of birds that migrate 
short distances. As for the European-African 
migrants, climate mismatch is a factor, but 
the long-distance migrants are particularly 
sensitive to habitat changes (14). Similar pat- 
terns might be expected for migratory birds 
in East Asia, but large-scale and long-term 
breeding bird population surveys, population 
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investigated the specific impacts of land-use 
and land-cover changes on migratory bird 
populations in detail (7). Such studies are 
urgently needed to disentangle the various 
factors acting on populations of migratory 
species. 

These studies must include carry-over ef- 
fects, because conditions at nonbreeding 
grounds can affect reproductive success in the 
breeding season (75). Conversely, conditions 
at breeding grounds can affect nonbreeding 
fitness. In the red knot, climate-induced mal- 
nutrition at their Arctic breeding grounds 
resulted in shorter bills and reduced survival 
rates at their African wintering grounds (16). 

Future studies must also consider the role 
of stopover sites. Most long-distance migra- 
tory species rely on considerable fueling at 
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such stopover sites before they embark on 
migratory flight, particularly if they need to 
cross oceans or deserts with no or limited 
feeding opportunities. The main fuel for mi- 
gratory flights is fat. Consequently, migrants 
must accumulate enormous amounts of 
fat before migratory flights, some doubling 
their body mass within just a few weeks. To 
achieve that timed fueling, suitable habitats 
and food must be available at stopover sites. 
The effect of habitat loss and degradation at 
stopover sites on population trends in trans- 
Saharan migrants remains uninvestigated. 

Consequently, future studies also need to 
clarify where species of conservation concern 
stop over and overwinter, and which migra- 
tion routes they take. For example, British 
common cuckoos migrate along two routes to 
the same winter destination in sub-Saharan 
Africa. They face a much higher en route mor- 
tality when migrating along a western route 
than along an easterly route (77). Emerging 
technologies for tracking individual migra- 
tory birds throughout their annual cycle (78) 
will reveal migratory routes and destinations 
in more detail than past bird marking, allow- 
ing more detailed and frequent assessment of 
the drivers of migratory species declines. 

Existing data can, however, already be 
translated into immediate conservation ac- 
tions to halt the decline of these migratory 
species. Wetlands can be protected from 
drainage; woody vegetation can be protected 
from grazing or even be replanted. Such ef- 
forts would not only support migratory birds 
but also the local biodiversity and livelihoods 
of local farmers and pastoralists. In addition, 
illegal taking and killing can be stopped. The 
required political instruments, such as the 
African-Eurasian Waterbird Agreement and 
the African-Eurasian Migratory Landbird 
Action Plan, are already in place. We just 
need to act, and we can if we wish. & 
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CANCER 


How tobacco smoke changes the (epi) genome 


Distinct mutation types are found in diverse cancers associated with smoking 


By Gerd P. Pfeifer 


orldwide, more than 1 billion 

people are tobacco smokers. Ciga- 

rette smoking drastically increases 

the risk of lung cancer. However, 

many other cancer types also occur 

more frequently in smok- 
ers than in nonsmokers, including 
cancers of the oral cavity, larynx, 
pharynx, esophagus, liver, cervix, 
pancreas, bladder, and kidney. Ge- 
nome sequencing efforts are begin- 
ning to provide more sophisticated 
clues as to the processes at work 
that are shaping the mutational 
landscape of tumors. On page 618 
of this issue, Alexandrov et al. (1) 
focused specifically on smoking- 
associated cancers and dissected 
genetic and epigenetic differences 
in tumors between smokers and 
lifetime nonsmokers. 

The specific molecular origins of most hu- 
man cancers are unknown. By examining the 
variety of genetic and epigenetic alterations 
found in tumors, a process described as mo- 
lecular archaeology can help to reconstruct 
the events that have taken place in the past 
and have led to the evolution of tumor ge- 
nomes. This field has its beginnings more 
than two decades ago when mutations in the 
TP53 tumor suppressor gene were found to 
be common but to differ quite substantially 
in their nature between different tumor types 
(2). Today, whole-genome or -exome sequenc- 
ing data have generally confirmed tumor 
type-specific mutations similar to those 
found earlier in TP53 (3). 

Alexandrov et al. examined 13 different 
cancer types that have been linked to in- 
creased risk in smokers and analyzed more 
than 5000 tumor genomes or exomes. They 
had previously developed an algorithm based 
on the six different types of possible base 
substitutions (C>A, C>G, C>T, T>A, T>C, 
T>G) and the 16 types of nearest neighboring 
bases, resulting in 96 specific trinucleotide 
substitutions (4). These substitutions occur 
in characteristic combinations in different tu- 
mor types and were assigned to ~30 so-called 
mutational signatures. Each type of tumor 
has one or several such signatures common 
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among patients. Beginning with lung cancer, 
the authors provided comprehensive com- 
parisons for lung adenocarcinomas in smok- 
ers and nonsmokers. This type of lung tumor 
is also seen in nonsmokers, unlike small cell 
and squamous cell lung cancers, which al- 
most always develop only in smokers. Smok- 


Smoking increases the risk of developing several types of cancer. 


ers’ adenocarcinomas stood out by a dramatic 
increase of “signature 4” mutations, defined 
as C>A/G>T base substitutions, which were 
almost absent in nonsmokers. The same 
signature 4 mutations have previously been 
implicated in an in vitro experiment with 
benzo[a]pyrene (5), a prominent member of 
the class of polycyclic aromatic hydrocarbons 
found in the tar fraction of cigarette smoke. 
In addition, benzo[a]pyrene forms strong 
DNA damage at tobacco-specific mutation 
hotspots, mechanistically linking polycyclic 
aromatic hydrocarbons to lung cancer muta- 
tions (6). Because signature 4 mutations were 


most prominent in tissues directly exposed 
to tobacco smoke (the lung and larynx), it is 
plausible that the signature is derived from 
DNA damage at guanine, which is then incor- 
rectly bypassed by DNA polymerases, leading 
to the observed mutations. Signatures 4 and 
5 were enriched in clonal mutations (they 
were found in all cells) of smokers, 
suggesting that they are due to ciga- 
rette smoke exposure before clonal 
expansion and tumor progression. 

Importantly, when the odds ratios 
for cancer in smokers versus non- 
smokers are highest (ranging from 
over 100 in small cell and squamous 
cell lung cancers to ~22 in lung ad- 
enocarcinomas and ~13 in laryngeal 
cancers), the contribution of signa- 
ture 4 to mutations in smokers is the 
greatest (see the figure). However, as 
the odds ratio becomes less than 5, 
such as for oral cavity or esophageal 
cancers, signature 4 is no longer en- 
riched in smokers. Thus, signature 4 is chiefly 
responsible for an increased mutational load 
and for the most strongly enhanced cancer 
risk in smokers. 

Another mutational signature found en- 
riched in several tobacco-associated cancers 
is signature 5, which is less distinct than sig- 
nature 4 and contains several types of base 
substitutions (/, 4). The molecular origin of 
this signature remains unknown. Although 
not experimentally proven, signature 4 might 
also be produced by aromatic amines, an- 
other class of chemicals found in tobacco 
smoke that has been linked to bladder cancer. 


More reasons to quit 


Enrichment of a specific mutational signature in cancers of smokers 


is strongly linked to the odds ratio of developing cancer. 
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However, the absence of signature 4 in blad- 
der cancer might suggest that aromatic 
amines are not directly involved in this pro- 
cess or that the mutations caused were too 
few to be detected. Instead, Alexandrov et al. 
see a strong enrichment of signatures 2 and 
13 in bladder tumors, which are character- 
ized by C to T or C to G mutations at TpC 
dinucleotides. These signatures have been 
linked to overactivity of the APOBEC (apo- 
lipoprotein B editing catalytic polypeptide) 
DNA editing enzymes (7). Signatures 2 and 
13 are also present in lung adenocarcinomas, 
where they seem to be enhanced by smoking. 
In fact, in 4 of 50 lung adenocarcinomas of 
both smokers and nonsmokers, signatures 2 
and 13 accounted for more than 70% of the 
mutations (J), raising questions about a pos- 
sible unique etiology or biology of these indi- 
vidual tumors. 

Besides mutations, tumor DNA also retains 
information regarding the modified base 
5-methylcytosine at CpG sequences. Alexan- 
drov et al. compared CpG methylation data 
for smokers and nonsmokers but did not find 
substantial differences in methylation (J). 
This finding is inconsistent with the com- 
mon belief that environmental exposures, 
including smoking, have a major effect on 
the DNA methylation machinery. However, 
for lung adenocarcinoma, they did find that 
a small fraction (<0.1%) of the queried CpG 
sites were differentially methylated in smok- 
ers with a tendency for hypomethylation. It 
will be interesting to see whether these hy- 
pomethylated CpG sites coincide with sites of 
intense DNA damage by tobacco carcinogens. 

The new study presents the most com- 
prehensive view to date on smoking-related 
genetic and epigenetic differences in hu- 
man tumors. It provides strong support 
for a prominent role of tobacco carcino- 
gens in cancer of tissues directly exposed 
to smoke but raises intriguing questions 
regarding why more “remote” tissues show 
an enhanced cancer risk in smokers in the 
absence of a well-defined mutational speci- 
ficity. Although smaller-scale mutational 
events are still possible, more indirect ef- 
fects of smoking should be considered. 
These might include other epigenetic 
events, immune system-related events such 
as inflammation, and/or metabolic distur- 
bances caused by tobacco smoking. 
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Deep-sea corals feel the flow 


Uranium isotope data from deep-water corals help to 
resolve conflicting results from shallow-water corals 


By Yusuke Yokoyama’ and Tezer M. Esat” 


hallow-water corals provide the 
only direct way of determining the 
absolute timing and magnitude of 
sea levels over the past 600,000 
years. Their uranium and thorium 
abundances and uranium isotope 
ratios, combined with coral-reef elevations, 
help to determine past sea levels (J). The 
veracity of the dating results depends on 
the state of preservation of fossil corals over 
extended time periods. Models have been 
developed to correct for specific complica- 
tions. Deep-sea corals (see the photo) do 
not provide sea-level information. However, 
as Chen et al. (2) show on page 626 of 
this issue, they appear to be responsive to 
ocean circulation changes and continental 
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riverine and ice-sheet meltwater inputs to 
the oceans. In addition, the data help to re- 
solve one of the most contentious issues in 
uranium series dating, namely whether sys- 
tematic variations in 7**U are due to physi- 
cal ocean processes or are artifacts of poor 
preservation. 

Corals take up uranium from the oceans, 
where it exists as a trace element, and 
concentrate it by a factor of 1000. The 
residence time of uranium in the oceans 
is over half a million years. In its source 
rocks, 7**U decays via the short-lived Th 
to **U. This radioactive decay process dam- 
ages the mineral locally, enabling eroding 
water to preferentially leach ***U. Further- 
more, ***U can recoil out into pore spaces, 
further assisting its preferential leaching. 
Thus, rivers carry large excesses of ?**U, and 
the oceans are enriched by 15%. Uranium 
is highly soluble whereas thorium tends to 
amalgamate with insoluble particles; there- 
fore, corals take up uranium but almost no 
thorium. The further decay of ?**U to ?°Th 
provides an age for the coral, and the ocean 
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341] levels can be back-calculated to when 
the coral was alive. For this dating method 
to work, however, a key requirement is that 
the sample remain closed to external addi- 
tion or removal of uranium or thorium; it 
does not work if the system is “open” due to 
exchange of U and Th between samples and 
their surrounding environments. 

Historically, efforts to calculate coral ages 
with the uranium-thorium method required 
the back-calculated ?°*U/?*U ratio to equal 
the present oceanic value; deviations from 
this value were interpreted as an indication 
of external influences and taken to mean 
that the age was unreliable (3). As data ac- 
cumulated, scientists focused on the last in- 
terglacial, ~130,000 years ago, to determine 
the reliability of the dating method. Coral 
samples from the last interglacial are easy 
to collect as they are above the present sea 
level. The data showed that the last intergla- 
cial extended from ~130,000 to 120,000 years 
ago. However, a substantial number of corals 
from the last interglacial had higher ?**U/?8U 
values than present-day corals (3-5); the 
3417/38) values appeared to increase almost 
linearly with age. To explain these observa- 
tions, Thomson and Goldstein proposed that 
circulating fluids containing **Th and *°Th 
had contaminated some of the samples (6). 
The age-*“U trend was used to correct for 
high 73*U/?8U by projecting it to the current 
34] value and to younger ages and was called 
an open-system age. 
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This deep-sea reef is formed by the coral species 
Lophelia pertusa at 450 m below the surface of the 
Gulf of Mexico, with a brisingid starfish in the center. 
As Chen et al. show, data from deep-sea corals can 
help to explain past variations in ocean uranium levels, 
thus putting uranium dating of shallow-water corals 
on firmer footing. 


There are good reasons for opposing this 
model. However, most scientists now accept 
that during the last ice age, the 7?4U/°U 
ratio of the ocean was lower than it is 
today and that it gradually increased to the 
present value over the past 20,000 years (7, 
8). Applying the model to the most precise 
single data set yielded a start date for the 
last interglacial at 125,000 years ago (6), 
rather than 130,000 years ago as previously 
estimated. This discrepancy put the utility 
of uranium-series dating in doubt. Is the 
timeline of climate and sea-level variations 
over the past 600,000 years derived from 
this dating reliable? 

An alternative explanation for high **U is 
that there were transient inputs of additional 
254U) to the oceans at times of rapid climate 
and sea-level changes, but this explanation is 
not generally accepted (9). One of the main 
reasons for this lack of acceptance is the diffi- 
culty of intuitively understanding the trends 
implied by the uranium-thorium equations. 
For example, it is generally held that older 
coral reefs have high current ?**U contents, 
yet exactly the opposite is true: They have 
low **U/?.U ratios. The high ratios are the 
back-calculated ocean values at the time 
when the coral was growing. 

This is why Chen e¢ al.’s study of deep- 
sea corals is important. The authors use the 
solitary corals as time markers of past ocean 
°34U) levels at depths from 400 to 2000 m. 
First, they confirm the low 7**U values dur- 
ing the last ice age. Second, their data show a 
transient peak in ?**U at the end of one of the 
periodic stop/start episodes of the Atlantic 
overturning circulation at 15,000 years ago. 
One could argue that the excess ?**U is due to 
open-system behavior, entailing random ex- 
ternal addition of *°*Th (which rapidly decays 
to **U) and *°Th. However, the systematics 
of the data and the quality of the samples do 
not support this conclusion. 

In addition, the samples used by Chen 
et al. are from solitary corals that grow 
individually and are not embedded in a 
reef. It is highly unlikely that they could be 
affected by circulating fluids enriched with 
Th isotopes. There is simply not enough Th 
in the water column to cause the observed 
effects, even over the course of 100,000 years. 
Therefore, the transient high-*'U levels 
are intrinsic to hydrospheric processes and 
not due to postmortem alteration of coral 
U/Th trace-element abundances. The excess 
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23417 was circulated from the surface to the 
deeper ocean following the resumption of 
the Atlantic current. It probably included 
nutrients, promoting a spurt in deep-sea 
coral growth. 

The present demonstration that transient 
ocean 7°*U excesses could have caused the 
observed high coral ?°*U values casts doubt 
on the veracity of the open-system model. 
If the ocean sustained transient high-?4U 
episodes during the last deglaciation, the 
same is likely to have happened during the 
penultimate deglaciation. A recent analysis 
of diverse evidence for the timing of the 
last interglacial favors the 130,000 year 
start date over the open-system estimate of 
125,000 years (0). 

Chen et al. postulate that the source of 
the excess 7**U was the subglacial basal melt, 
which was in contact with frozen sediments 
laden with excess 7*4U (11). This is a likely 
scenario, but other evidence points to episodic 
sea-level rise as a probable mechanism. A 
large fraction (>80%) of uranium supplied by 
rivers is sequestered in nearshore sediments, 
salt marshes and mangroves, estuaries, and 
continental margins during periods of warm 
climate and high sea levels. These nearshore 
areas are exposed during periods of low 
sea level, resulting in rapid oxidation of 
uranium into highly soluble phases. The 
subsequent release of 7**U-enriched uranium 
into the oceans occurs over a sustained 
period as sea levels rise and can be taken 
up by corals growing close to coastal areas 
before the excess uranium is dispersed into 
the wider ocean. 

There is abundant evidence for a 
connection between *“U and sea level at 
locations far from polar ice sheets, including 
at Huon Peninsula, Papua New Guinea (9). 
In contrast, it is surprising that the low- 
latitude Pacific deep-sea corals do not seem 
to have been as affected given their proximity 
to Antarctica (72). Future studies will better 
delineate the sources and scope of external 
transient inputs of “*U into the oceans as 
indicators of meltwater pulses and rapid sea- 
level changes, as opposed to stochastic open- 
system artifacts in coral. 
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BIOTECHNOLOGY 


Tips for battling billion-dollar beetles 


Tiny insecticidal proteins (Tips) kill pests that evolved resistance to engineered corn 


By Bruce E. Tabashnik 


n the never-ending war on insect pests, 
the widespread soil bacterium Bacillus 
thuringiensis (Bt) is one of the greatest 
heroes. Insecticidal crystalline (Cry) pro- 
teins and vegetative insecticidal proteins 
(Vips) from Bt are treasured for their ef- 
fectiveness against some devastating pests 
and their safety for beneficial insects, wild- 
life, and people (1). Sprays containing Bt pro- 
teins have been used for more than 70 years 
and remain valuable in organic and conven- 
tional agriculture, forestry, and vector control 
(1). Crops genetically engineered to produce 
Bt proteins were introduced 20 years ago 
and quickly became a cornerstone of pest 


management. They have suppressed pest 
populations, reduced reliance 
on insecticide sprays, enhanced 
control by natural enemies, and 
increased farmer profits (2-4). 
In 2015, Bt crops were planted 
on 84 million hectares glob- 
ally (5), including 81% of the 


to Cry3Bb was initially detected in Iowa 
in 2009 and subsequently in several other 
midwestern states (II, 12). Four Bt toxins ac- 
tive against rootworms are available now in 
Bt corn: Cry3Bb, mCry3A, eCry3.1Ab, and 
Cry34/35Ab. However, resistance to Cry3Bb 
causes cross-resistance to its close relatives 
mCry3A and eCry3.1Ab (13, 15), and resis- 
tance to Cry34/35Ab was detected in Iowa 
in 2013 (/4). Factors hastening evolution of 
rootworm resistance to Bt corn include the 
moderate efficacy of the toxins and growing 
corn that produces the same Bt toxin year af- 
ter year (11, 12). 

The U.S. Environmental Protection Agency 
requires farmers to plant corn that does not 
produce Bt toxins near Bt corn to promote 
survival of susceptible insects and thereby de- 


Pests evolving resistance faster to engineered Bt crops. In the 10 cases of field-evolved 
practical resistance to Bt crops in five pest species, the time from the first commercial planting 
of the Bt crop to the first evidence of resistance decreased over the past two decades (r? = 
0.60, df = 8, P< 0.01) (7 8, 11-15). Western corn rootworm resistance to the four rootworm- 
active Bt proteins made by transgenic corn includes cross-resistance to eCry3.1Ab detected in 
the field before plants producing this toxin were commercialized. 


et al. may fit the bill. By systematically 
screening soil for microbes other than Bt, 
they discovered that the bacterium Pseudo- 
monas chlororaphis has a potent inhibitory 
effect on rootworms. They isolated an in- 
secticidal protein from P chlororaphis and 
determined its amino acid sequence. They 
named it IPD072Aa and matched it to a pro- 
tein of 86 amino acids predicted from a gene 
in the P. chlororaphis genome. From various 
databases, they identified 18 proteins with 38 
to 83% sequence identity to IPD072Aa: four 
from P. chlororaphis, eight from six other spe- 
cies of Pseudomonas, three from the switch- 
grass rhizosphere microbial community 
metagenome, and one each from the bacteria 
Burkholderia, Xenorhabdus, and Photorhab- 
dus. Four of these homologs were toxic when 
fed to corn rootworms in ar- 
tificial diet. IPD072Aa and its 
homologs are dubbed here as 
Tips because they are much 
smaller than previously de- 
scribed insecticidal proteins 
from bacteria. 
Schellenberger et al. genet- 


corn and 84% of the cotton in 8 
the United States (6). But with 
increasingly rapid evolution of 
pest resistance to Bt crops (7, 8) 


oO 


ically engineered corn plants 
to make IPD072Aa. In green- 
house and field trials, the 


(see the figure), alternatives are 
urgently needed. On page 634 of 
this issue, Schellenberger et al. 
(9) report the discovery of tiny 


protection from rootworm 
injury provided by these 
transgenic plants was simi- 


insecticidal proteins (Tips) from 
other soil bacteria that could be 
part of the solution. 


Years to resistance 
on 


DO 


lar to that from Bt corn pro- 
ducing Cry34/35Ab. In a field 
trial in Iowa during 2014, 
rootworm injury to corn 


The Tips found by Schellen- 
berger et al. kill one of the most 
destructive and adaptable crop 
pests in North America and Eu- 0 
rope, a beetle called the western 
corn rootworm (Diabrotica vir- 
gifera virgifera). The larvae of 
this pernicious pest and closely related spe- 
cies devour corn roots, costing growers more 
than $1 billion each year (J0). The advance by 
Schellenberger et al. is timely because evolu- 
tion of resistance has undermined control of 
rootworms by Bt corn (17-15). 

The first transgenic corn targeting root- 
worms produces Bt toxin Cry3Bb and was 
commercialized in the United States in 
2003. Field-evolved rootworm resistance 
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lay evolution of resistance (11, 12). However, 
the current regulations have been challenged 
as insufficient to achieve this goal because 
they allow up to 95% Bt corn on each farm 
(11). Moreover, adaptation to Bt corn plants 
making either a Cry3 toxin or Cry34/35Ab 
will accelerate evolution of resistance to re- 
cently introduced Bt corn plants that produce 
a Cry3 toxin in combination with Cry34/35Ab 
(8). Therefore, controls effective against root- 
worms resistant to the currently available Bt 
toxins are especially significant. 

The proteins identified by Schellenberger 
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was much higher for plants 
producing Cry3Bb than for 
those making IPD072Aa. 
The extensive damage to 
corn producing Cry3Bb im- 
plies rootworm resistance 
to that toxin. The efficacy of 
IPD072Aa in this field trial suggests its utility 
against Cry3Bb-resistant rootworms. Labo- 
ratory bioassays show that IPD072Aa killed 
laboratory-selected strains of rootworm resis- 
tant to mCry3A or Cry34/35Ab. 

The absence of strong cross-resistance to 
IPD072Aa caused by resistance to Cry34/35Ab 
or Cry3 toxins is not surprising because Tips 
are strikingly different from Bt toxins. In- 
deed, Schellenberger et al. report that none of 
the Tips examined had any motifs, domains, 
or signatures known from other proteins. It 
will be important to determine how these 
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intriguingly distinct proteins kill rootworms. 

The billion-dollar question remains: Will 
Tips be a valuable alternative to Bt toxins for 
deployment in transgenic plants? Factors that 
might impede application of bacterial insec- 
ticidal proteins other than Bt toxins include 
their large size, lack of oral toxicity to pests, 
and broad toxicity to nontarget organisms. 
Tips are small and orally toxic to two species 
of corn rootworm (9). Bioassays with five cat- 
erpillar pests and one piercing-sucking pest 
indicate that they are not broadly toxic to in- 
sects (9). To better understand their toxicity 


Rootworm larvae can devour corn roots. 


spectrum, testing of more species is needed, 
including vertebrates and beneficial beetles 
such as the “ladybirds” famed for their role 
in biological pest control. 

If Tips or other new control tactics such as 
RNA interference are widely adopted, root- 
worms will surely evolve resistance, just as 
they have to Bt corn and the time-honored 
tactic of crop rotation (10-15). In light of the 
remarkable adaptability of this and other in- 
sect pests, the increased efforts to find new 
natural insecticides spurred by the discovery 
of Tips could pay great dividends. 
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CHEMICAL BIOLOGY 


A radical approach to 
posttranslational mutagenesis 


Introducing diverse protein side chains via carbon-carbon 


bond-forming reactions 


By Raphael Hofmann and Jeffrey W. Bode 


he structure and function of pro- 

teins is extensively modulated and 

expanded by posttranslational modi- 

fications (PTMs) on many of the 

canonical amino acids. As a means 

of unraveling the role and interplay 
of PTMs, methods to produce proteins 
with site-specific modifications have at- 
tracted considerable attention. In addition 
to PTMs, incorporation of unnatural side 
chains is of interest for protein engineer- 
ing, and systematic studies using analogs 
of amino acids offer insights into the mo- 
lecular mechanisms by which proteins and 
enzymes work. On pages 597 and 623 of 
this issue, Wright et al. (1) and Yang et al. 
(2) report potentially general strategies to 
chemically introduce a wide variety of natu- 
ral, unnatural, posttranslationally modified, 
and labeled side chains via an unprece- 
dented carbon-carbon bond-forming reac- 
tion on intact proteins. This approach will 
be of particular interest to chemical biolo- 
gists aiming to introduce authentic protein 
PTMs, as well as to protein chemists inter- 
ested in introducing unnatural side chains 
of their choice. 

Genetic code expansion approaches such 
as amber-codon suppression (3) have lifted 
the narrow constraints imposed by the ge- 
netic code and routinely enable the recom- 
binant production of proteins containing 
certain PTMs or unnatural amino acids. 
However, recombinant incorporation of 
unnatural amino acids relies on the avail- 
ability of, and efficient processing by, an 
orthogonal transfer RNA (tRNA) and tRNA 
synthetase pair. Protein synthesis (4, 5) or 
semisynthesis (6) allows for full flexibility 
with regard to unnatural residues but can 
pose a challenging task for large protein 
targets. Alternatively, site-specific chemi- 
cal modifications of expressed proteins at 
cysteine (Cys) or dehydroalanine (Dha) resi- 
dues—which serve as potent nucleophiles 
and electrophiles, respectively—represent 


Laboratory of Organic Chemistry, Department of Chemistry 
and Applied Biosciences, ETH Zurich, Zurich 8093, 
Switzerland. Email: bode@org.chem.ethz.ch 


Published by AAAS 


versatile and relatively simple strategies 
and have enabled the generation of biocon- 
jugates or PTM mimics (7, 8). However, such 
“chemical mutagenesis” strategies have so 
far resulted in products containing an un- 
natural thioether bond in the side chain. 

Wright et al. and Yang et al. now describe 
a three-step chemical mutagenesis strategy 
(see the figure). First, the site of modifica- 
tion in the expressed protein undergoes 
“genetic tagging.” This tagged residue is 
converted to the common Dha reactive pre- 
cursor. The Dha residue is allowed to react 
with alkyl carbon free radicals, generated 
from alkyl halides, to introduce the desired 
side chain via a carbon-carbon single bond. 
In this regard, the authors have leveraged 
the analogous alkyl halides like those used 
for Cys alkylation strategies but obtain 
products containing an all-carbon back- 
bone instead of the thioether linkage. 

For tagging and activation of the modifi- 
cation site, Wright et al. rely on Cys muta- 
tions and their posttranslational conversion 
to Dha using bis-alkylation with 2,5-dibro- 
mohexane diacetamide, followed by elimi- 
nation in mildly basic buffer (pH 8.0) (8). 
Yang et al. use their previously established 
system for recombinant incorporation of 
O-phosphoserine (Sep) (9) and a base-pro- 
moted elimination reaction (40 mM barium 
hydroxide) to convert Sep into Dha. The ex- 
istence of several other methods to obtain 
Dha-containing proteins (J0O-12) increases 
the chance of finding a procedure compat- 
ible with a given protein target. 

Wright et al. report, in detailed model re- 
actions with small molecules, peptides, and 
proteins, the successful and chemoselective 
zinc (Zn°)-mediated radical reaction between 
Dha and alkyl halides in aqueous buffer. The 
occurrence of undesired side reactions—in- 
cluding substitution of the Ca capto-dative 
radical with a second equivalent of the alkyl 
radical, as well as oxidative protein cleavage 
with molecular triplet oxygen—prompted 
further optimization. The best results were 
obtained by using sodium borohydride 
(NaBH,) instead of Zn°, along with oxygen- 
free conditions (glovebox), which resulted in 
selective bond formation in many common 
buffers (pH 4 to 8) and suppressed side reac- 
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tions. Yang et al. found that a radical reaction 
promoted by Zn° and copper(II) acetate in 
the presence of surfactants and metal ligands 
(tetramethylethylenediamine) in aqueous 
acidic conditions (sodium acetate, pH 4.5) 
gave the best conversion. Both procedures 
call for up to 3000 equivalents of alkyl halide 


for good conversion, which in some cases is 
limited by their solubility in water. 

The reaction tolerates primary, secondary, 
tertiary, and cyclic aliphatic halides, as well 
as alkyl halides bearing unprotected amines, 
phosphonates, guanidines, amides, or hy- 
droxyl groups. The choice of the appropri- 


Introducing unnatural side chains 
Cysteine or O-phosphoserine residues at a particular prot 


ein site are chemically converted 


to dehydroalanine (Dha), which can react with carbon-centered radicals generated from the 


corresponding halides R-X to introduce new side chains. 
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ate organic halide, typically readily available 
either commercially or synthetically, allows 
for systematic variation of side chains. The 
value of the system is demonstrated by the 
introduction of several side chains previ- 
ously unattainable with codon suppression 
techniques. Yang et al. report the incorpora- 
tion of mono-, di-, and trimethylated lysine 
at position 79 of histone H3 and its assembly 
into chromatin templates. This feat allowed 
the authors to study the effects of the various 
methylation states on in vitro transcription. 
Incorporation of asymmetric dimethylargi- 
nine at position 26 of histone H3 and its as- 
sembly into mononucleosomes by Wright et 
al. allowed the identification of partner pro- 
teins through affinity enrichment proteomics 
in human cell extracts. 

Combined, the authors achieve site- 
specific modification of 10 structurally di- 
verse proteins with a multitude of different 
side chains. Wright et al. also demonstrate 
that their conditions are compatible with 
a disulfide-containing single-domain anti- 
body fragment. It remains to be seen what 
other complex protein targets are compat- 
ible with these reaction conditions. For ex- 
ample, many research groups will be curious 
about the modification of antibodies, which 
contain easily reducible interchain disulfide 
bonds (13). The generation of diastereomeric 
products at the a-center of the modified resi- 
dues might limit its use for the production of 
pharmaceutical bioconjugates. For the mo- 
ment, the method is restricted to a single site 
of modification within a protein. Regardless, 
this novel carbon-carbon bond-forming reac- 
tion could be used readily in chemical biol- 
ogy, given its versatility with regard to both 
protein substrates and side chains while re- 
taining the flexibility offered by other chemi- 
cal mutagenesis approaches. By demonstrat- 
ing the chemoselectivity of protein radical 
reactions, this work may further provide 
fruitful ground for the development of other 
radical-based protein modifications. 
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WATER 


Water strategies for 
the next administration 


Water policy offers opportunity for nonpartisan agreement 


By Peter H. Gleick 


ssues around fresh water are not partic- 

ularly high on the U.S. political agenda. 

They should be. Water problems directly 

threaten food production, fisheries, en- 

ergy generation, foreign policy, public 

health, and international security. Access 
to safe, sufficient, and affordable water is vi- 
tal to well-being and to the economy. Yet U.S. 
water systems, once the envy of the world, are 
falling into disrepair, and new threats loom. 
Drinking water disasters in Flint, Michigan, 
droughts and floods increasingly attribut- 
able to anthropogenic climate change (J), 
and growing violence worldwide over water 
offer a glimpse of what we face unless new 
efforts are made to address fail- 
ing infrastructure, worsening cli- 
mate conditions, and ineffective 
policies and regulations (2). Yet, 
if there is any issue that offers 
the opportunity for nonpartisan 
agreement, it is to create and im- 
plement a 21st-century national 
water policy. In that vein, I detail 
national and international water challenges 
and recommendations for the next U.S. presi- 
dent, administration, and Congress. 

Federal agencies, authorities, and poli- 
cies are often inconsistent, overlapping, and 
inefficient. Addressing water challenges 
requires consistent, effective, and efficient 
Management and institutions. Yet ~30 dif- 
ferent federal agencies or departments have 
overlapping and conflicting responsibilities 
for fresh water. For example, the Bureau of 
Reclamation (BoR), Army Corps of Engi- 
neers, and agencies like the Tennessee Valley 
Authority each build and manage dams. The 
Environmental Protection Agency oversees 
tap water quality, but the Food and Drug Ad- 
ministration oversees bottled water quality. 
The National Park Service, BoR, Forest Ser- 
vice, and others manage water resources on 
lands under their jurisdiction, often within 
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the same watershed. A mix of federal and 
state agencies and commissions manage in- 
ternational agreements over the shared wa- 
ters with Canada and Mexico. 

Production of food by U.S. farmers is at 
risk because federal water, energy, and agri- 
cultural policies often have conflicting and 
contradictory priorities and objectives (3). 
National policies designed to boost biofu- 
els production had unanticipated impacts 
on food production and regional water de- 
mands. For example, an average of 780 liters 
of water are required to produce a liter of eth- 
anol from irrigated corn, much of this from 
overdrafted aquifers in the Great Plains (4). 

The next U.S. president should create a bi- 
partisan water commission to evaluate and 


“.-Uf there is any issue that offers the 
opportunity for nonpartisan agreement, it 
is to create and implement a 2Ist-century 

national water policy.” 


recommend changes to national water poli- 
cies. The commission would provide guid- 
ance to reorganize and streamline the diverse 
and uncoordinated federal water responsibil- 
ities and laws, including better coordination 
among energy, water, and food policies. We 
have had no such guidance since the final 
report of the last U.S. National Water Com- 
mission in 1973, which first called for—and 
helped drive acceptance of—environmental 
water policies, improved water-quality regu- 
lation, and better economic tools for utilities 
(5). Such commissions offer the opportunity 
to generate nonpartisan recommendations 
that can overcome political barriers to action. 

Basic water science and data collection 
remain undone. Vital water data are not col- 
lected or analyzed, and fundamental hydro- 
logic science remains incomplete (6). There is 
massive groundwater overdraft in California 
and the Great Plains-Ogallala aquifer but lit- 
tle accurate information about withdrawals 
or recharge rates. The U.S. Geological Survey 
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collects and publishes water-use data only 
every 5 years (7), and data are not collected 
in a comprehensive or consistent manner. 
Links between clean and adequate water and 
healthy aquatic ecosystems are strong, but 
little information is available on sustainable 
watersheds and freshwater management. 

A national program to expand collection, 
management, and release of water supply 
and use data is key to developing sustainable 
policies and improving water sciences. This 
includes federal support for remote sensing 
platforms, such as replacing the SMAP (Soil 
Moisture Active Passive) satellite sensors and 
fully funding the National Oceanic and Atmo- 
spheric Administration’s (NOAAs) Joint Polar 
Satellite System. Funding and expanding the 
new National Water Center, coordinated by 
NOAA, is a step in the right direction. 

Critical water infrastructure is often ob- 
solete and decaying. The United States pio- 
neered and built water treatment and delivery 
systems that provide nearly all Americans 
with safe water and sanitation and eliminated 
cholera, dysentery, and other water-related 
diseases prevalent in other parts of the world. 
But hundreds of thousands—if not millions— 
of Americans still lack access to safe water. 
Recent failures—such as in Flint, Michigan, 
where bad technical, financial, and manage- 
ment decisions led to high levels of lead in 
the water—highlight underinvestment in sys- 
tem maintenance and replace- 
ment. Water in rivers, streams, 
and lakes is inadequately pro- 
tected from contamination by 
weak or unenforced regulations, 
especially nonpoint sources of 
pollution from agriculture and 
urban development. Public and 
private water agencies are not 
adequately monitoring and enforcing exist- 
ing laws and regulations. 

The next president and Congress must 
work together to modernize water-quality 
laws—in particular the Clean Water Act and 
the Safe Drinking Water Act (SDWA)—and 
give federal agencies resources to oversee 
and enforce these laws. Challenges include 
improving our ability to understand and 
mitigate uncontrolled sources of pollution in 
streams, rivers, and lakes; adding regulations 
to address long-ignored risks to groundwater; 
and accelerating rules for new contaminants 
in drinking water. Hundreds of unregulated 
chemicals and microbes may pose health 
risks (8) but no new contaminant has been 
added under the SDWA for decades. Other 
priorities should be the complete elimination 
of lead fixtures in cities, the testing of water 
in every school, and remediation of any con- 
tamination problems, and investment in new 
water treatment and reuse technologies. 

In regions where water availability is a 
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growing problem, especially the western 
United States, urban and farm water use can 
be made far more efficient with technologies 
such as precision irrigation, soil-moisture 
monitoring, and modern appliances and by 
using policy tools such as the Environmental 
Quality Incentives Program in the Farm Bill, 
trade laws, national efficiency standards, and 
tax-code revisions that promote water-effi- 
ciency investments for growers, industry, and 
communities. State revolving-loan programs, 
tax incentives, and direct support, especially 
for economically disadvantaged communi- 
ties, can expand funds available for upgrad- 
ing infrastructure. A 2013 report concluded 
that these strategies could produce hundreds 
of thousands of new jobs in urban and ag- 
ricultural conservation and efficiency, storm- 
water management, alternative water sup- 
plies, and ecological restoration (9). 

Links between water conflicts and national 
security are clear and growing. There is a 
long history of political insecurity and insta- 
bility in regions where access to fresh water 
is a problem (see the figure). Recent experi- 
ence in the Middle East shows that water 
problems contribute to food shortages, cre- 
ate environmental refugees, weaken govern- 
ments, and worsen civil conflict (70). In Syria, 
severe drought contributed to economic and 
political destabilization, and attacks on water 
systems have led to a worsening humanitar- 
ian disaster and new tensions between the 
United States and Russia. There are ongoing 
examples of violence related to water scar- 
city and control in eastern Africa and central 
and southern Asia. An updated list of such 
conflicts can be found at the Water Conflict 
Chronology database (http://worldwater.org/ 
water-conflict/). In 2012, the National Intel- 
ligence Council released an assessment of 
national security threats to the United States 
from global water challenges (77). Similar 
assessments show how climate change may 
contribute to state collapse and threaten 
peace and security (12). To understand and 
reduce these risks, the intelligence commu- 
nity must monitor water-related threats to 
US. security and interests. The Department 
of State should expand diplomatic efforts 
to prevent water-related conflicts world- 
wide, and the United States should ratify the 
United Nations Convention on the Law of the 
Non-Navigational Uses of International Wa- 
tercourses—the major international agree- 
ment that provides guidelines for peacefully 
managing shared watersheds (13). 

Many people still lack basic safe water 
and sanitation. More than 2 billion people— 
nearly a third of the global population—lack 
reliable, affordable access to basic water and 
wastewater services. Water-related diseases 
are prevalent in many developing countries, 
leading to nearly 250 million illnesses and 
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millions of preventable deaths a year, mostly 
among children (/4). Even in the United 
States, many communities continue to suf- 
fer from contaminated drinking water or 
lack the financial resources to install modern 
treatment and distribution systems. 

As part of the international effort to reach 
the new Sustainable Development Goal of 
achieving 100% coverage of safe water and 
sanitation by 2030, Congress should boost 
the modest resources currently available for 
domestic and international programs to meet 
basic human needs for water and to monitor 
water-related diseases. The next U.S. presi- 
dent will be responsible for developing and 
issuing a Global Water Strategy in fall 2017, 
as required by the 2014 Paul Simon Water 
for the World Act (75). This law, passed with 
bipartisan support in Congress, aims to redi- 
rect and expand U.S. foreign aid to increase 
access to safe water, sanitation, and hygiene 
in high-priority countries; work to improve 
watershed management in such countries; 
and help reduce water-related conflicts. 

Climate change impacts on water resources 
and systems are worsening. The most recent 
national scientific assessment of climate risks 
for the United States identifies a wide range 
of growing risks to water resources (J6). Ris- 
ing temperatures are increasing demands for 
water and rapidly melting snow and ice. Ris- 
ing sea levels are threatening coastal aquifers 
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and wetlands. Some floods and droughts, 
already the nation’s most destructive natural 
disasters, are now more extreme because of 
human-caused climate change (/7). 

We need to integrate climate change into 
water management and planning to help 
adapt to unavoidable and increasingly harsh 
impacts of climate change. All federal agen- 
cies that manage land and water must in- 
tegrate scientific findings around climate 
impacts and adaptation into long-term plans 
and current operations. The next administra- 
tion will have to implement the provisions 
of the Paris Agreement that call for reducing 
emissions of greenhouse gases and work to 
prepare the country for the impacts of cli- 
mate change that can no longer be avoided. 

We have neglected the nation’s fresh wa- 
ter far too long. The next administration 
and Congress have the opportunity and re- 
sponsibility to ensure that federal agencies, 
money, and regulations work to protect our 
waters, citizens, communities, and national 
interests. 
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One brain, many genomes 


Single-cell genomic techniques shed light on somatic 
mutations in brain development and neurologic disorders 


By Gilad D. Evrony 


e each begin life as a single cell 

harboring a_ single genome, 

which—over the course of devel- 

opment—gives rise to the trillions 

of cells that make up the body. 

From skin cells to heart cells to 
neurons of the brain, each bears a copy of 
the original cell’s genome. But as anyone 
who has used a copy machine 
or played the childhood game of 
“telephone” knows, copies are 
never perfect. Every cell in an 
individual actually has a unique 
genome, an imperfect copy of its 
cellular ancestor differentiated 
by inevitable somatic mutations 
arising from errors in DNA repli- 
cation and other mutagenic forces (1). So- 
matic mutation is the fundamental process 
leading to all genetic diseases, including 
cancer; every inherited genetic disease also 
has its origins in such mutation events that 
occurred in an ancestor’s germline cells. 
Yet how many and what kinds of somatic 
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mutations accumulate in our cells as we 
develop and age has long been unknown 
and a blind spot in our understanding of 
the origins of genetic disease. 

While in the laboratory of Christopher 
Walsh at Boston Children’s Hospital and 
Harvard Medical School, I became in- 
trigued by reports of neurologic diseases 
caused by somatic mutations, including 
rare cases of epilepsy, neurodegeneration, 
intellectual disability, brain mal- 
formations, and autism spectrum 
disorder (2-8). There were also 
long-standing hypotheses in the 
field that somatic genetic diver- 
sity may be prevalent in the hu- 
man brain (9). Still, the common 
view was that the brain operates 
from a unitary genome. 

I wondered how much of a genomic 
patchwork is our brain? What kinds and 
how many somatic mutations are present, 
and do they affect brain function? Could 
somatic mutations underlie some of the 
neuropsychiatric diseases whose causes re- 
main unknown? 

Answering these questions, we recog- 
nized, would require development of a 
technology to sequence the vanishingly 
small amount of DNA (6 picograms) pres- 


ent in single brain cells. Any individual 
somatic mutation may be present in only 
a very small fraction of cells or even just 
one cell, making it undetectable by stan- 
dard DNA sequencing, which mixes to- 
gether DNA from thousands or millions of 
cells. Together with colleagues, I developed 
methods to sequence the genomes of single 
brain cells, allowing detection of even the 
rarest somatic mutations. Along with novel 
bioinformatic approaches developed with 
Eunjung Alice Lee in the lab of Peter Park, 
this provided the first systematic, genome- 
wide measurements of somatic mutation 
in the brain (J0, 17). Together with single- 
cell studies of cancer and sperm (12-/4), 
this heralded the emergence of the field of 
single-cell genome sequencing, spurred by 
the confluence of whole-genome amplifica- 
tion technologies (15) and decreasing DNA- 
sequencing costs. 

Our single-neuron genomics studies 
have identified remarkably diverse somatic 
mutations that reveal a wide gamut of mu- 
tation processes impacting the brain, from 
small point mutations and microsatellite 
polymorphisms to larger retrotransposon 
insertions, copy-number variants, and 
aneuploidy (J0, 11, 16, 17) (see the figure). 
Notably, we are finding that each type 
of mutation occurs at distinct rates and 
patterns (J0, 11, 16-18). These and single- 
neuron studies by others provide a proof of 
principle for the systematic quantification 
of somatic mutations in any human tissue 
(19, 20). 

During my doctoral work, I was part of a 
team led by Ann Poduri that identified the 
first brain-specific somatic mutations caus- 
ing neurologic disease (27). The disease in 
question was hemimegalencephaly, a rare 
congenital brain malformation in which 
one brain hemisphere is severely dysplastic 
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and grows too large, leading to intractable 
epilepsy. Using single-cell sequencing, we 
found that ~20 to 30% of both glial and 
neuronal cells carry the disease-causing 
mutations, allowing us to pinpoint neuro- 
glial progenitors of the cortex as the cell 
type where the mutations occurred (8, 10, 
16). This finding provides insight into the 
source of hemimegalencephaly and _ re- 
lated focal neurologic diseases and may 
potentially be relevant to other epilepsies 
of unknown origin. Whereas hemimegalen- 
cephaly is visible by imaging, our findings 
suggest that radiographically invisible so- 
matic mutations— for example, in ion chan- 
nels or synaptic proteins—may be an occult 
cause of some neurologic diseases. As a re- 
sult of these studies, numerous groups have 
begun systematic investigations of somatic 


mutations in neuropsychiatric disease. 

A key tool in developmental biology is a 
technique known as lineage tracing, in which 
the fate of all the offspring of a particular cell 
(or group of cells) are tracked as the body 
forms. In animal models, this is achieved 
by tagging cells with fluorescent proteins or 
other invasive markers that cannot be used 
in humans. Somatic mutations, however, oc- 
cur naturally and, as we were able to dem- 
onstrate, can serve as noninvasive lineage 
markers in humans (J, IJ, 17). 

Using somatic mutations identified by 
our single-neuron sequencing, we have 
been able to trace cell lineages in the hu- 
man brain and reconstruct their prolifera- 
tion and migration during development (17, 
17). Clear spatial patterns were revealed, 
including cell lineages distributed over an 
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entire hemisphere at surprisingly low mo- 
saicism indicating spatial mixing among 
early brain progenitors, as well as somatic 
mutations marking focally distributed lin- 
eages present in only one small (<1 cm?) 
area of the frontal cortex (//, 17). The lat- 
ter pattern would seem to indicate that 
every brain is a mosaic patchwork of focal 
somatic mutations. It is therefore possible 
that rare, as-yet unrecognized brain dis- 
orders may exist in which a focal somatic 
mutation affects only one small region re- 
sponsible for a particular cognitive func- 
tion, while sparing the rest of the brain. 
Our studies have generated a number of 
captivating new questions. Might the brain 
be particularly susceptible to harmful so- 
matic mutations because of the unique 
interconnectedness of its cells? Would neu- 
rogenetic diseases—for example, genetic 
cases of autism—manifest differently if the 
inciting mutation were present in only one 
hemisphere, one lobe, or just one gyrus of 
the brain? Single-neuron sequencing may 
also be useful for measuring the extent to 
which neurons, whose genomes must func- 
tion for decades, accumulate mutations 
with age and determining whether these 
mutations eventually impair function. We 
believe that single-cell genomics combined 
with single-cell epigenomics, transcrip- 
tomics, and proteomics will ultimately 
revolutionize our understanding of brain 


development and function. & 
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Parsing reward from aversion 


Investigating the neural circuits encoding emotional valence 


By Anna Beyeler 


tarting from the moment we hear our 
alarms in the morning, our emotions 
guide the thousands of decisions we 
make every day. More specifically, it 
is the valence of our emotions that 
determines our subsequent behavior. 
Valence is a concept that was originally de- 
fined in psychology and corresponds to the 
value we assign to the perceptions of our ex- 
ternal and internal environments 
(1). Valence varies from negative, 
when we are afraid or anxious, to 
positive, when we are happy or 
peaceful. In the case of the morning 
alarm, if your emotional state has 
a positive valence you might jump 
out of bed, eager to engage with 
whatever is motivating you. Con- 
versely, if your emotional state has a negative 
valence, you might choose to stay in bed to 
Avoid the causes of your negative emotions. 
Beyond allowing or preventing your timely 
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arrival to work, the neural circuits support- 
ing valence encoding are critical for your sur- 
vival. Imagine the consequences if you were 
to assign a positive valence to, and therefore 
seek, dangerous situations, for example. In 
fact, misassignment of valence is not rare, 
and dysfunctions of the circuits encoding 
valence are thought to underlie many psy- 
chiatric illnesses including anxiety, depres- 
sion, addiction, and compulsive disorders (2, 
3). From a fundamental and clinical point of 
view, understanding how the brain 
attributes valence to contexts and 
salient elements of our environ- 
ment is one of the main challenges 
of modern neuroscience. 

It has been known for decades 
that the basolateral amygdala 
(BLA) is necessary for associative 
learning of both positive and nega- 
tive valence (4, 5). However, understanding 
how a single brain region can be respon- 
sible for encoding such a diverse range of 
information has been elusive. 

One hypothesis to explain this paradox 
is that specific subpopulations of the BLA 
encode opposite valence. As a postdoctoral 
fellow in the Laboratory of Professor Kay 
Tye at the Massachusetts Institute of Tech- 
nology (MIT), I have dedicated the past 4 
years to testing this hypothesis, combining 
cutting-edge approaches in functional neu- 
roanatomy, electrophysiology, and optoge- 
netics. During this time, my collaborators 
and I assessed different subpopulations of 
the BLA that relay integrated information 
to distinct regions. Over multiple studies, I 
identified circuit, synaptic, and molecular 
mechanisms by which three distinct neural 
populations in the BLA encode innate and 
learned behaviors (6-8). 

Consistent with recent studies in other 
brain regions (9), we found that different 
BLA neuronal populations (as defined by 
their projection targets) differentially en- 
code valence. To do so, we used optogenetic- 
mediated photo-tagging in combination 
with large-scale electrophysiological record- 
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ings during the retrieval of positive or nega- 
tive associative memories in mouse models 
and examined the activity of BLA neurons 
projecting to the nucleus accumbens, the 
centromedial amygdala, or the ventral hip- 
pocampus (see the figure). 

Although heterogeneity of response types 
was known to exist in the entire BLA, we 
were surprised to find that when mice en- 
counter cues predicting a rewarding or 
aversive outcome (JO, 11), heterogeneity of 
response types was also noted in BLA sub- 
populations. However, two BLA popula- 
tions differentially encode valence during 
the retrieval of cues of opposite valence 
(7). Specifically, BLA neurons projecting to 
the nucleus accumbens were preferentially 
excited by cues predicting a reward, and 
those projecting to the central amygdala 
were preferentially excited by cues predict- 
ing an aversive outcome. The response-type 
heterogeneity we observed in BLA sub- 
populations could play a role in behavioral 
flexibility in cases in which the valence of 
environmental stimuli changes. 

Consistent with our recordings in vivo, 
the synaptic inputs of BLA neurons pro- 
jecting to the nucleus accumbens are po- 
tentiated after mice learn to associate 
a cue with a reward and depressed after 
mice learn to fear a cue associated with 
a foot shock (8). Conversely, the synaptic 
inputs of BLA neurons projecting to the 
centromedial amygdala were potentiated 
after fear learning and depressed after re- 
ward learning (8). Optogenetic activation 
of these projections induced approach and 
avoidance behavior, respectively (8), show- 
ing that these divergent pathways causally 
control valence encoding. 

Although we had previously found that 
optogenetically simulating the BLA neurons 
that project to the ventral hippocampus 
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drives negative valence in innate anxiety- 
related behaviors (6), we determined that 
these neurons do not preferentially respond 
to cues predicting unpleasant outcomes (7). 
These results add a level of complexity to 
our understanding of valence circuits and 
support a model of distinct routing for 
the attribution of valence for innate and 
learned information. 

We have begun to unravel how BLA 
populations encode valence in physiologi- 
cal conditions (6-8). We can now start to 
investigate how valence circuits are dys- 
functional in pathologies such as anxiety 
and depression. Ultimately, we hope this re- 
search will lead to the development of novel 
strategies to restore the function of those 
circuits in human patients. 
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Building connections 


Studies in the developing retina provide insights 


into neural circuit assembly 


By Arjun Krishnaswamy 


uch of a child’s first year is spent 

asleep, punctuated by the occa- 

sional feeding, fit, or flatulence. 

The serenity is only skin deep, 

as a flurry of action goes on be- 

neath. The brain is building itself 
at a frantic pace during this year, forming 
millions of synapses per second. Remark- 
ably, speed does not incur a cost 
in precision—children reach their 
first birthday with specific, largely 
mature patterns of connectivity 
already formed, ready to become 
terrible at 2. How the brain man- 
ages this remarkable feat is not 
altogether clear. 

We do know part of the story. 
It begins with the generation of neurons, 
which migrate to appropriate locations and 
extend axons long distances to target areas 
(1). The puzzle is what happens next, when 
axons and dendrites form their remarkably 
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specific patterns of connectivity. Forming 
synapses is an astonishingly promiscuous 
process. Decades of study indicate that 
neurons readily synapse with each other 
if they share matched transmitters and re- 
ceptors, as well as generic organizing mol- 
ecules. Yet, axons in fact synapse with only 
small subsets of potential targets and often 
only on specific portions of a target—for ex- 
ample, a distal dendrite, a soma, or an axon 
hillock (J, 2). 

How do these selective con- 
nections arise? One idea finds 
its roots in the classical work by 
Langley and Sperry: Selectivity 
between potential synaptic part- 
ners arises because they have a 
specific chemical affinity for one 
another (3-5). Endowing neurons 
with this affinity might be the role for the 
large families of recognition molecules that 
take up ~3% of the genome (1). There is 
ample evidence that such proteins regu- 
late cell-cell contact but little evidence that 
their combinatorial expression patterns are 
related to specific connectivity. Could such 
expression patterns encode some or all of 
the nervous system’s blueprint? 


Retinal circuit assembly 


Model for selective synapse formation and circuit assembly in the retina. Progressively narrowing the options 
available to developing neurons leads to specific connections among interneurons and retinal ganglion cells. 


Modified from (8). 


1 
Synaptic 2 


layer _ 
4 
5 
Ganglion cell layer 
(1) Laminar (2) Laminar 
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VISUALIZING THE WIRING OF THE RETINA 
Conceptually, deciphering the code requires 
us to draw links among genes, circuitry, and 
function. Technically, this requires a list of 
the neural components to be wired, tools to 
mark and manipulate them, and methods 
to map circuitry. For much of the central 
nervous system, this is a tall order, but the 
retina is a notable exception. 

The retina is a sophisticated neural com- 
puter whose parallel circuits preprocess the 
visual scene to highlight salient features be- 
fore this information is transmitted to the 
brain (6). Each of the 30 or more retinal 
circuits begins with photoreceptors, which 
work as photon detectors. Each ends with 
a retinal ganglion cell (RGC), which acts as 
a feature detector that conveys visual infor- 
mation to the brain. There are ~30 types 
of RGCs, each uniquely attuned to features 
such as motion, color, contrast, and so on. 
In between are 70 or more types of inter- 
neurons. RGCs are endowed with their 
preference for different features via the syn- 
apses that they receive from specific subsets 
of interneurons, called amacrine cells (ACs) 
and bipolar cells (BCs) (6, 7). How this pre- 
cision is established and its contribution to 
retinal circuitry are unclear. 

Initially, methods to map retinal circuitry 
were lacking. An ever-growing compendium 
of mouse lines that let one mark and manipu- 
late many of the ~100 cell types in the retina, 
however, made this problem tractable. 

We designed and built a custom two- 
photon microscope-physiology rig to pro- 
vide spatiotemporally restricted optoge- 
netic excitation to hundreds of genetically 
defined interneurons in the mouse retina 
while recording intracellularly from defined 
RGC subtypes. Armed with this approach, 
we could map functional connectivity 
among the retinal types and study how 
these maps arise. 


CADHERINS DIRECT LAYER-SPECIFIC 
INNERVATION 

Using the technique described above, we 
first discovered a pair of related BCs that 
provide visual input to directionally se- 
lective RGCs (DSRGCs) (8). Anatomically, 
these BC axons target layers occupied by 
DSRGC dendrites, which suggests that BCs 
recognize these locations in the retinal neu- 
ropil. Using genetic screens, we discovered 
that a pair of cadherins were expressed 
by these BCs and later determined that 
these cadherins position the BC axons in 
layers containing DSRGCs dendrites. This 
organization is critical for DSRGC visual 
responses—perturbing it by deleting cad- 
herins disconnects BC input and effectively 
renders DSRGCs blind (8). Conceptually, 
choosing a layer mitigates the wiring chal- 
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lenge by limiting the total number of po- 
tential partners that must be connected. 


A“SIDEKICK” ENHANCES PAIRING WITH 
POTENTIAL SYNAPTIC PARTNERS 

Layering alone cannot explain the circuitry of 
the retina. Simply put, there are ~30 kinds 
of retinal circuits but only 5 to 10 layers in 
which to contain their wiring—how do neu- 
rons find each other within this tangled 
thicket? We reasoned that such selectivity 
should result in enriched connectivity be- 
tween a subset of colaminar neurons and 
set out to determine whether such patterns 
existed by functionally mapping more than 
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a dozen colaminar interneuron-RGC pairs. 
We discovered that ACs defined by expres- 
sion of the vesicular glutamate transporter 3 
(VG3-ACs) connect well to one type of RGC, 
W3B RGCs, but poorly to other RGCs in the 
same layer (9). This selectivity is remarkable 
because VG3-W3B connections are interdigi- 
tated with those of several others. 

Such complexity poses a serious wiring 
challenge for these two neurons and suggests 
the presence of organizers within layers that 
operate at an incredibly fine scale. Parallel 
experiments confirmed this suspicion—VG3 
and W3B connect strongly because they both 
express a homophilic immunoglobulin su- 
perfamily (IgSF) adhesion molecule called 
Sidekick 2 (9). Connections of VG3 and W3B 
with equally proximate partners that are 
Sidekick-negative are substantially weaker. 
This indicates that recognition biases local 
connectivity in favor of particular pairings. 


A HIERARCHY OF WIRING CHOICES BUILDS 
RETINAL CIRCUITS 

This work leads to a speculative model 
for retinal circuit assembly that revolves 
around progressively narrowing the op- 
tions available to developing neurons (see 
the figure) (8, 9). This model is attractive 
because it unburdens the genome of hav- 
ing to provide unique recognition for every 
synapse. For example, the division of labor 
with cadherins for layers and IgSFs for tar- 
gets might allow the same IgSFs to be used 
in several layers at once. Reality might 
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be even simpler, since ~10 other kinds of 
wiring choices have been documented, 
each enacted by largely distinct genetic 
programs (7). Combinations of such genetic 
programs might direct neurons through a 
hierarchical sequence of choices that result 
in particular connectivity patterns. 

Do neurons use all of these strategies? 
Or only a subset? How are they combined? 
Could the path of a given neuron influence 
the wiring of another? And perhaps most im- 
portant, how do activity and experience act 
on this initial scaffold to refine patterns of 
connectivity? Understanding these systems- 
level interactions among circuit patterning 
rules is critical for developmental neurobiol- 
ogy’s ultimate goal—to know how to build a 
brain. This goal, if realized, might offer ma- 
jor clues to brain function and new avenues 
of diagnosis and treatment of dysfunctions 
caused by mental illness. 
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Against all odds 


A geologist revels in the unlikely reality of life on Earth 


By Jill S. Schneiderman 


hen esteemed geologist Walter Al- 

varez and his colleagues searched 

the lowlands of Eastern Mexico 

for ancient debris that had been 

ejected from the Chicxulub impact 

crater, they were following in the 
footsteps of renowned early-20th-century 
paleontologist Charles Doolittle Walcott. In 
his own time, Walcott wandered the Cana- 
dian Rockies seeking Cambrian-aged fossils 
that could shed light on this crucial time pe- 
riod when multicellular life began to flour- 
ish. As recounted in his obituary, “One of the 
most striking of Walcott’s faunal discoveries 
came at the end of the field season of 1909, 
when Mrs. Walcott’s horse slid in going down 
the trail and turned up a slab that at once 
attracted her husband’s attention. Here was 
a great treasure—wholly strange Crustacea 
of Middle Cambrian time.... Snow was even 
then falling, and the solving of the riddle 
had to be left to another season.” Serendip- 
ity combined with persistence eventually led 
Walcott to one of the most important dis- 
coveries in the history of geology—the soft- 
bodied fauna of the Burgess Shale. 

In the case of the Alvarez group, two bro- 
ken jeeps on the last day of the field season 
of 1991 nearly prevented the researchers 
from reaching the strange sand bed that 
Alvarez had chanced to read about in a book 
about the geology of the region published 
in 1936. As they “bounced along the rough 
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A Most Improbable Journey 
A Big History of Our Planet 
and Ourselves 


Walter Alvarez 
Norton, 2016. 256 pp. 


road following the Arroyo el Mimbral, wor- 
rying as the Sun got lower in the sky,’ they 
came upon a layer of glass spherules ex- 
posed in a steep bluff along the dry river 
bed. They suspected that the spherules were 
droplets of impact melt—“the most wonder- 
ful outcrop I have seen in five decades as a 
geologist,’ writes Alvarez in his new book, A 
Most Improbable Journey: A Big History of 
Our Planet and Ourselves. 

Alvarez is best known for helping to es- 
tablish that a meteor struck the Yucatan, 
causing the mass extinction of half the 
genera of animals on Earth. In A Most Im- 
probable Journey, he tells the story of the 
cosmos, Earth, life, and humanity using 
the interdisciplinary approach of Big His- 
tory, which combines traditional historical 
scholarship with scientific insights. Alva- 
rez aims to instill in his readers a sense of 
wonder that, despite enormous odds, there 
exists a planet (Earth) supremely suited for 
life. At the same time, he seeks to cultivate 
an expanded view of the nature of history, 
replete with contingency and consequent 
improbability, and to foster appreciation for 
the enormous stretches of time and space 
across which history has unfolded. 

In each section of A Most Improbable 
Journey—“Cosmos,” “Earth,” “Life,” and 
“Humanity’—Alvarez uses both broad 
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Fossils found in the Burgess Shale have yielded 
insights into our planet’s extraordinary biodiversity. 


questions (have there been  recogniz- 
able patterns, regularities, cycles, and 
contingencies in the history of continental 
motions?) and “little Big History” (how did 
the Spanish language come to dominate 
the Iberian Peninsula and then much of 
Latin America?) to help us comprehend 
“our whole situation.” 

Throughout the book, Alvarez uses evoca- 
tive phrases and images: History is “violent 
and chancy,” rocks “remember [their] his- 
tory,’ and mountains “are not wrecks—they 
are sculptures.” Compelling images of rock 
and architectural engravings, relief and 
sketch maps, and historical photographs 
and drawings enrich the discussion but, 
unfortunately, are not referred to directly 
in the text. 

Alvarez invites his audience to read the 
book chapters in any order. Though I chose 
to read the book cover-to-cover, each chap- 
ter does indeed stand alone and therefore 
lends itself to this type of engagement. The 
book contains an appendix of resources that 
Alvarez annotates thoroughly, and it acts 
as an additional chapter that readers will 
likely enjoy perusing. 

The paleontologist and historian of science 
Stephen Jay Gould wrote in an essay about 
the discovery of the Burgess Shale fauna, “So 
much of science proceeds by telling stories.... 
Even the most distant and abstract sub- 
jects, like the formation of the universe or 
the principles of evolution, fall within the 
bounds of necessary narrative” (1). In A Most 
Improbable Journey, Alvarez harnesses such 
narrative, enabling readers to experience 
the power of Big History. 
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Priceless 


A cautionary tome probes the 
cost of black box algorithms 


By Barry Nalebuff 


n Virtual Competition, law professors 
Ariel Ezrachi and Maurice Stucke be- 
gin with a scary premise: What if com- 
puters could collude? How would we 
even know it was happening, and what 
could we do about it? 

For starters, most pricing is already done 
by computer algorithms. A single airline 
flight may have more than 40 different 
fares. Uber’s decision to surge is based on 
dynamic estimates of capac- 
ity utilization. Amazon has 
more than 200 million prod- 
ucts for sale, too many for 
even an army of wizards be- 
hind the curtain to set prices. 

In the good old days, firms 
that wanted to collude would 
get together in bars or indus- 
try conferences and share pric- 
ing sheets. Today, they share 
pricing algorithms. In 2015, 
in the first Internet commerce 
antitrust case, David Topkins 
pleaded guilty to sharing 
pricing formulas for collect- 
ible posters sold on Amazon’s 
Marketplace. If Topkins and 
his competitors used the same 
pricing scheme, then no seller 
would undercut another. 

But that’s old wine in new bottles. Far 
more frightening are three new ways that 
computers can collude. There’s hub-and- 
spoke collusion, in which a firm sets prices 
for all its independent contractors. Uber, for 
example, sets the price for a trip; individual 
drivers can’t undercut each other. At least 
this coordination has the advantage of be- 
ing visible and easy to detect. 

More problematic is when a nefarious 
programmer writes code that encourages 
collusion, creating, for example, a system 
that immediately matches any price cut and 
follows any price hike. Such behavior trains 
rivals (humans or machines) to refrain from 
price cutting and to raise prices without 
fear of losing share. Although frightening, 
the collusive intent is hardwired into the 
code and thus discoverable. 
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Scariest of all is a scenario in which a 
computer figures out both the advantages 
of collusion and how to make it happen. 
Here, the situation might resemble what 
happened with AlphaGo, the computer 
program developed to play the board game 
Go. The program’s success was mostly due 
to machine learning. The computer played 
countless games against itself and figured 
out what worked best. The end result is a 
black box: We don’t really know how the 
computer is making decisions, only that it 
works. Because successful collusion leads 
to higher profits, it would make sense that 
computers—left to their own devices— 
would figure this out. Antitrust authorities 
would have no way to punish this type of 
collusion under existing laws. 

That’s not all that worries our digital 
Cassandras. The use of big data may allow 
monopolists to be much more entrenched 


~ 


Self-learning algorithms are increasingly being used for retail pricing and logistics. 


and much more effective at raking in prof- 
its, often at consumer expense. For exam- 
ple, the cookies on my computer can tell 
that I never go to price-comparison web- 
sites. I fall into the “asleep-at-the-wheel” 
category, which makes me a prime target 
for price increases. Amazon may have 
sworn off charging different consumers 
different prices for the same book, but 
other retailers, credit cards, and insurance 
companies provide personalized pricing. 

Will big data lead to “the end of com- 
petition as we know it”? Not yet. Ezrachi 
and Stucke recognize that computers can 
also provide solutions. I can give my proxy 
to a program that specializes in ferreting 
out the best deals, for example, leading to a 
battle of buyer bots versus seller bots. 

The authors often do a better job de- 
scribing the problems than offering solu- 
tions. One proposed solution to a potential 
Uber monopoly has a government regulator 
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setting prices. The fact that San Francisco 
is able to dynamically monitor and set 
prices for parking meters doesn’t give me 
faith that a regulator could do a better job 
dynamically setting prices for Uber—even 
if they had all of the company’s data. An 
additional concern is that such regulation 
could choke off innovations such as Uber 
pool or driverless cars. 

Another option the authors consider 
is for Uber to give up central pricing 
and have drivers bid on each trip. If that 
sounds complicated, it is. A 
ride-sharing app called Side- 
car tried it and shut down in 
2015. Even if it worked—per- 
haps with driver computer 
bots placing bids and pas- 
senger bots deciding which 
to accept—this could be a 
recipe for drivers to engage 
in local price discrimination. 
Today, the nearest driver 
usually takes the job at the 
Uber set price. With driver 
bidding, a nearby driver, 
knowing he or she is closest, 
could charge extra for the 
quicker pickup. 

Most important is the 
question of whether this is 
a problem that needs fixing. 
Today, Uber is in a price war 
with Lyft; in the future, it will face com- 
petition from Google and perhaps Apple. 
New technologies displace what once 
looked like entrenched monopolies. Re- 
member Blackberry? Blockbuster? 

We owe the authors our deep gratitude 
for anticipating and explaining the conse- 
quences of living in a world in which black 
boxes collude and leave no trails behind. 
They make it clear that in a world of big 
data and algorithmic pricing, consumers 
are outgunned and antitrust laws are out- 
dated, especially in the United States. They 
want their readers to agitate for new anti- 
trust laws. Sign me up. When the masses 
get mad enough, perhaps they’ll elect a 
new trust-busting Teddy Roosevelt for the 
digital era. In the meantime, we can hope 
the ideas discussed in Virtual Competition 
get on the political agenda. 
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Children play on fishing boats at Lake Tanganyika, one of the East African Great Lakes threatened by oil exploitation. 


Oil extraction imperils 
Africa’s Great Lakes 


AS THE WORLD’S demands for hydrocar- 
bons increase (J), remote areas previously 
made inaccessible by technological limita- 
tions are now being prospected for oil and 
gas deposits. Virtually unnoticed by the 
public, such activities are ongoing in the 
East African Great Lakes region, threat- 
ening these ecosystems famed for their 
hyper-diverse biota, including the unique 
adaptive radiations of cichlid fishes (2). 
Countries in the region see exploitation of 
hydrocarbon reserves as a vital economic 
opportunity. In the Lake Albert region 

of Uganda, for example, the government 
foresees a $3.6 billion oil profit per year 
starting in 2018—a sum almost as high as 
the country’s current annual budget (3). 
However, oil extraction in the East African 
Great Lakes region poses grave risks to the 
environment and local communities. 

The thousands of oil spills reported in 
Nigeria (4) demonstrate that the extrac- 
tion and transport of oil are prone to 
accidents. This is especially bad news for 
the African Great Lakes because they are 
virtually closed ecosystems. For example, 
for Lake Tanganyika, which contains about 
one-fifth of the world’s surface freshwater 
(5), the flushing time is ~7000 years (5). 
This time frame implies that the recovery 
from an oil spill could take millennia. To 
make matters worse, the lakes’ location in 
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a remote part of the world would impede a 
quick and effective reaction to an oil spill. 
Appropriate infrastructures are currently 
unavailable at the lakes, and bringing in 
heavy equipment at the time of a spill 
would be cumbersome, logistically impos- 
sible, or prohibitively expensive. 

An oil spill would markedly affect the 
health, water supply, and food security of 
local communities (6). More than 10 million 
people depend on Lake Tanganyika alone 
for fisheries and water resources, and many 
more along the Congo River, into which 
the lake drains, are highly dependent on 
the lakes’ ecosystem (7). In addition to the 
toll on humans, an oil spill in these lakes 
would be a global catastrophe for biodi- 
versity. Combined, these lakes are home to 
thousands of species, almost all of them 
endemic (2). An accident might deal a 
final blow to these ecosystems, which have 
already been rendered fragile by anthro- 
pogenic stressors such as overfishing, 
deforestation, and global warming (8). 

Finally, large parts of the East African 
region still lack political stability and 
security (9). In addition to the possibility 
of accidents, competition for hydrocarbon 
resources could lead to sabotage, as has 
unfortunately been frequently observed in 
the Niger delta (4). 

We are concerned that the risks associ- 
ated with the intended exploitation of fossil 
hydrocarbons in the East African Great 
Lakes region are seriously underestimated. 
We urge the countries involved in these 
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undertakings to engage with the scientific 
and lake management communities to iden- 
tify strong mitigation and control measures 
that could be put in place before hydrocar- 
bon production begins. Local governments 
should foster alternative, sustainable plans 
to develop the region in accordance with the 
United Nations Sustainable Development 
Goals (10). To this end, the local population, 
regional stakeholders, governments, non- 
governmental organizations, and scientists 
must cooperate to develop economically and 
ecologically viable strategies for the region, 
as is currently being attempted for the 
Virunga National Park in the DR Congo (Z7). 
Erik Verheyen,* on behalf of the 
Cichlid Science 2015 Meeting and 
concerned scientists 
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Time for responsible 
peatland agriculture 


THE 15TH INTERNATIONAL Peat Congress, 
held in Asia for the first time, brought 
together industry, policy-makers, and 
academia to discuss responsible peatland 
management. In Southeast Asia, peatland 
management is largely driven by the palm 
oil industry. After the Congress, misleading 
reports were published by leading Asian 
newspapers. They claimed that oil palm 
plantations on peatland can be viewed as 
sustainable [e.g., (7)] and supported the 
continuation of business-as-usual peatland 
agriculture. This is contrary to the opinion 
of an overwhelming number of tropical peat- 
land scientists (2) and the vast majority of 
science published in the past two decades. 

Deep, carbon-rich peat deposits are 
maintained by a combination of steady 
organic matter inputs and high water tables, 
which inhibit microbial decomposition (3). 
Conversion of peat swamp forest (the natu- 
ral vegetation of Southeast Asian peatlands) 
to agricultural land requires removing 
vegetation and lowering groundwater 
tables. The combination of slash and burn 
techniques and drainage used to prepare 
peat for agriculture promotes smoldering 
fires and rapid peat oxidation. Peat fires are 
globally significant for their greenhouse gas 
emissions and threats to human health and 
regional economies (4). Peat oxidation leads 
to high CO, emissions and land subsidence. 
As the land surface falls toward river and sea 
levels, it will be subject to periodic and even- 
tually permanent flooding, limiting future 
agricultural production (5). Agricultural use 
of peatlands cannot, therefore, be considered 
sustainable from either environmental or 
socioeconomic perspectives. 

Industry and academia are working 
together to develop peatland agricultural 
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systems (6) that minimize negative envi- 
ronmental and commercial impacts. In the 
interim, steps should be taken to improve 
hydrological management of peatlands 
under agriculture and to implement 
landscape-scale management planning. 
Denial of known issues slows progress 
toward responsible solutions, which are 
urgently needed to prevent avoidable 
losses of Southeast Asia’s peatlands, as 
well as global consequences. 
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Protecting China’s 
soil by law 


AFTER SEVERE PROBLEMS with air and 
water pollution, China is getting serious 
about its soil (J, 2). On 31 May, China’s State 
Council released an action plan for soil pol- 
lution prevention and remediation, aiming 
to make 90% of polluted, arable land safe for 
human use by 2020 and increase this to 95% 
by 2030 (3). This ambitious action plan calls 
for the support of strong environmental laws 
to monitor, prevent, and remediate seri- 

ous levels of soil contamination. However, 
national legislation protecting soil quality 
has lagged behind that of air and water for 
more than a decade, while Chinese law- 
makers debate the focus and purpose of a 
soil protection law. 

On 3 September, China released a draft 
of its first environmental tax law, which 
designates four taxable types of pollution: 
airborne and water pollutants, solid waste, 
and noise (4). Soil is conspicuously absent. 
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A soil protection law could close the cur- 
rent environmental legislation system’s 
loopholes, make China’s new environmen- 
tal tax system more comprehensive, and 
protect China’s soil. 

To make a pragmatic soil protection 
law, the central government must clearly 
identify local government’s liability and 
responsibility for soil pollution, as ambigu- 
ous responsibility has been one of the major 
problems in soil management in the past. 
The law must stipulate the division of duties 
between government agencies, establish a 
surveying and monitoring system, and intro- 
duce funding mechanisms. Remediation of 
contaminated soil is extremely costly, and 
China needs to create a national soil fund 
by allocating an adequate percentage of 
its land revenues and environmental tax 
revenues. Some members of the soil pollu- 
tion plan panel initially suggested that 10% 
of land revenues be designated to the fund 
(5). Because China lacks comprehensive risk 
assessment systems for contaminated land 
management (6), the law should stipulate 
risk management and control approaches 
for contaminated sites. 

China’s soil pollution has become a critical 
issue that affects public health and creates 
social unrest and instability (/, 7, 8). China 
should not repeat its past mistakes of focus- 
ing on economic growth at the expense of 
the environment. 
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By Peter Stern and Leslie Roberts 


t can hit as a piercing jolt, or sometimes as a dull ache that radiates 
through the body. Pain can be exquisite, or debilitating and chronic, 
which can destroy one’s quality of life. 

Pain starts with a stimulus at the periphery of the body, which then 
travels through neurons to the brain. But the process is far more com- 
plex. A number of neuronal circuits are involved in pain transmission. 
Most of these circuits are plastic and can change when something goes 
awry. Once we understand these circuits, we can interfere and block 
them at the right level. 

However, pain is not only a matter of neurons. The tissue around 
them plays an important role, too. Other cells—such as skin, immune, or 
glia cells, to name just a few—participate in the pathogenesis and also the 
resolution of pain. Nor is there a simple one-way street from the periphery 
to the brain. Powerful cognitive processes shape the way that we perceive 
pain. This perception is determined by our expectations and the situation 
in which we find ourselves. Clinicians need this knowledge to develop tech- 
niques for personalized treatment of chronic pain and to prevent pain from 
spiraling out of control. 

Failed attempts to manage pain have contributed to the opioid epidemic in 
the United States. Understanding how opioids work, and how in some circum- 
stances they might actually amplify pain, should lead to alternatives. As many 
states legalize the medical use of marijuana, hints are emerging that cannabis 


SPECIAL SECTION 


NEWS 
Pot and pain p. 566 


Primed for pain p. 569 


REVIEWS 


Pain regulation by non-neuronal cells 
and inflammation p. 572 


Neural circuits for pain: Recent 
advances and current views p. 578 


Deconstructing the sensation of pain: 
The influence of cognitive processes 
on pain perception p. 584 


Exploring pain pathophysiology in 
patients p. 588 


RELATED ITEMS 

> EDITORIAL P. 529 
> PODCAST 

> VIDEO 


Pain is a subjective experience. People 
distracted by a suspense-packed movie 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


could be one, but research is complicated by strict federal regulations. experience less pain at the dentist's office. 
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Hints are emerging that cannabis could be 
an alternative to opioid painkillers 


By Greg Miller 


n the mid-19th century, some Euro- 
pean doctors became fascinated with a 
plant-derived drug recently imported 
from India. Cannabis had been used 
as medicine for millennia in Asia, and 
physicians were keen to try it with 
their patients. No less an authority 
than Sir John Russell Reynolds, the 
house physician to Queen Victoria 
and later president of the Royal College of 
Physicians in London, extolled the medical 
virtues of cannabis in The Lancet in 1890. 
“In almost all painful maladies I have found 
Indian hemp by far the most useful of 
drugs,” Reynolds wrote. 

Like other doctors of his day, Reynolds 
thought cannabis might help reduce the 
need for opium-based painkillers, with 
their potential for abuse and overdose. “The 
bane of many opiates and sedatives is this, 
that the relief of the moment, the hour, or 
the day, is purchased at the expense of to- 
morrow’s misery,’ he wrote. “In no one case 
to which I have administered Indian hemp, 
have I witnessed any such results.” 

More than 125 years later, the misery 
caused by opioids is clearer than ever, and 
there are new hints that cannabis could be 
a viable alternative. Some clinical studies 
suggest that the plant may have medical 
value, especially for difficult-to-treat pain 
conditions. The liberalization of marijuana 
laws in the United States has also allowed 
researchers to compare overdoses from 
painkiller prescriptions and opioids in 
states that permit medical marijuana versus 
those that don’t. Yet following up on those 
hints isn’t easy. Clinical studies face addi- 
tional hurdles because the plant is listed on 
Schedule I, the U.S. Drug Enforcement Ad- 
ministration’s (DEA’s) list of the most dan- 
gerous drugs. 

Some researchers worry that rigorous 
research is being outpaced by informal ex- 
perimentation, as millions of people with 
access to medical marijuana treat them- 


selves. “It’s clear that the policy has gone 
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way out in front of the science in terms of 
allowing access to products that haven’t 
been through the standard clinical trials 
process,” says Mark Ware, a pain specialist 
at McGill University in Montreal, Canada. 


NEARLY 2 MILLION AMERICANS were ad- 
dicted to or abusing prescription opioid 
drugs in 2014, according to the Centers for 
Disease Control and Prevention, and the 
Kaiser Family Foundation estimates that 
more than 21,000 died from overdoses. 
That same year, a study published in JAMA 
Internal Medicine hinted that medical 
marijuana could make a dent in that alarm- 
ing toll. The researchers, led by Marcus 
Bachhuber, then at the Philadelphia Veter- 
ans Affairs Medical Center in Pennsylvania, 
examined death certificates in all 50 states 
between 1999 and 2010. They found that the 
annual rate of deaths due to overdose on an 
opioid painkiller was nearly 25% lower in 
states that permitted medical marijuana. 
In 2010, that translated into 1729 fewer 
deaths in those states. The researchers also 
found that the effect grew stronger in the 
5 to 6 years after the states approved 
medical marijuana. 

More recently, David Bradford, a health 
economist at the University of Georgia in 
Athens, and his daughter Ashley, a master’s 
student there, sought to investigate whether 
marijuana was supplanting conventional 
drugs in states where it’s legal. Analyzing 
Medicare drug prescription data from 2010 
to 2013, they found a significant difference 
in the number of prescriptions for several 
conditions, including anxiety and nausea, 
in states with medical marijuana. But one 
condition stood out from the rest: “The ef- 
fect for pain was three to four times larger 
than all of the others,” David Bradford says. 
In medical marijuana states, each physician 
prescribed an average of 1826 fewer doses 
of conventional pain medication each year, 
they reported in the July issue of Health Af- 
fairs. That translates into many millions of 
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For researchers, marijuana is 
tightly regulated, but some 
patients can easily buy it at 
dispensaries like this one. 


doses per year in those states. 

The Bradfords haven’t yet analyzed how 
many of those doses were opioid drugs ver- 
sus other painkillers, but David Bradford 
suspects it’s a large chunk. “It’s suggestive 
evidence that medical marijuana might help 
divert people away from the path where 
they would start using [an opioid drug], 
and of course if they don’t start, they’re 
not on that path to misuse and abuse and 
potentially death.” 

In a follow-up study, the Bradfords ana- 
lyzed prescription data from Medicaid re- 
cipients, a younger population than the 
Medicare enrollees in their previous study. 
So far, the reduction in pain prescriptions 
appears to be even more dramatic in this 
group, David Bradford says. 

Additional evidence about whether can- 
nabis can reduce opioid use could come 
from Canada, which legalized medical 
marijuana in 2001 and might legalize recre- 
ational use as soon as next year. In Quebec, 
researchers established a patient registry 
in 2015 to collect demographic data on 
patients who use medical marijuana, the 
type and dose they take, and the conditions 
they're seeking treatment for, along with 
self-reports on benefits and adverse out- 
comes. McGill’s Ware, who is leading the ef- 
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fort, says the registry is also collecting data 
on opioid use. “We'll certainly be looking at 
whether patients who manage their pain 
with cannabis can reduce their opioid doses 
over time or even wean themselves off opi- 
oids entirely,’ he says. 

Yet in the United States, where 25 states 
and Washington, D.C., have legalized medi- 
cal marijuana, there are no state-wide ef- 
forts to collect data on how patients are 
using cannabis or on whether they have 
been affected for good or ill, in part be- 
cause marijuana is still illegal at the federal 
level. That’s a huge missed opportunity, says 
Ryan Vandrey, a behavioral pharmacologist 
at Johns Hopkins University in Baltimore, 
Maryland. “It’s mind-boggling that we have 
millions of people in the U.S. using can- 
nabis for medicine and we not only don’t 
have the proper data to help them take it 
appropriately, we’re not doing a good job of 
collecting it.” 

Researchers have good reason to think 
marijuana might relieve pain. Tetra- 
hydrocannabinol, or THC, the plant’s main 
psychoactive ingredient, binds to a class 
of receptors on neurons that are involved 
in mediating pain, appetite, and mood, 
among other things. “It’s working directly 
on pain pathways in the brain, spinal cord, 
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and periphery,’ says Ethan Russo, a neuro- 
logist and medical director of Phytecs, a Los 
Angeles, California-based company develop- 
ing therapies based on compounds isolated 
from marijuana. Previously, Russo oversaw 
international clinical trials for Sativex, an 
oral spray made by GW Pharmaceuticals in 
Salisbury, U.K., that has been approved in 
27 countries for treating spasticity caused 
by multiple sclerosis and in Canada for cer- 
tain types of pain. Sativex combines THC 
with cannabidiol, another compound in 
marijuana that may counteract the anxi- 
ety and cognitive side effects associated 
with THC and that appears to have anti- 
inflammatory effects. 

But few marijuana-based therapies 
have gone through clinical trials. A meta- 
analysis published last year in the Journal 
of the American Medical Association found 
just 28 randomized clinical trials investi- 
gating cannabis for chronic pain. (Sativex 
accounted for nearly half of them.) The 
authors concluded there was “moderate 
quality evidence” to support its use. Part of 
the reason for the scarcity of cannabis tri- 
als is that whole plants and natural extracts 
aren't patentable, giving pharmaceutical 
companies little incentive to pursue them. 

Recently, however, some states that have 
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legalized medical marijuana have begun to 
fund clinical studies. California, which in 
1996 became the first U.S. state to legalize 
medical marijuana, led the way with its Cen- 
ter for Medicinal Cannabis Research, which 
has done several placebo-controlled studies 
on pain. Barth Wilsey, a pain management 
physician at the University of California in 
San Diego, led two of them. The first, pub- 
lished in 2008, found that smoking mari- 
juana reduced pain caused by nerve damage 
in 38 patients, with minimal side effects. The 
second, published in 2013, found that vapor- 
ized cannabis, even in low doses, relieved 
pain in a similar group of patients who 
hadn’t responded to traditional medications, 
including opioid analgesics. 

A trial just getting underway at the Uni- 
versity of Colorado (CU) Anschutz Medical 
Campus in Aurora will be the first to directly 
compare cannabis and opioid painkillers in 
patients with back and neck pain. “There’s 
definitely emerging evidence in the literature 
for [using cannabis to treat] neuropathic 
pain, but there’s hardly anything for chronic 
back and neck pain, which is one of the most 
common reasons people go see their doctor,” 
says neurobiologist Emily Lindley, who will 
run the CU study. 

Hers is one of nine medical marijuana re- 
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search grants funded so far by the state of 
Colorado with a total of $9 million from tax 
collected on marijuana sales. The impetus for 
the study was a survey done a few years ago 
at CU Hospital’s Spine Center. Nearly one- 
fifth of the 184 patients with chronic back 
and neck pain who responded to the survey 
reported using marijuana to treat their pain. 
Of those, 86% reported that it “moderately” 
or “very much” relieved their pain, and 77% 
said marijuana provided as much or more 
relief than their opioid prescription pain- 


a pill.) At each visit, the patients will be given 
a battery of tests to assess their pain levels 
and look for side effects like impairments 
of memory, attention, and concentration. 


BUT SUCH RESEARCH faces regulatory obsta- 
cles, because DEA still classes marijuana as 
a Schedule I drug: the most dangerous drugs 
with no known medical benefits. It has taken 
Lindley nearly 2 years from the time she re- 
ceived her grant to start her study. Getting the 
required Schedule I license from DEA took 


Arising toll 


Statistics from the Kaiser Family Foundation show a growing epidemic of opioid abuse in the United States over the 
last 15 years. In the cartogram below, the size of each state reflects the total number of prescription opioid overdose 
deaths from 1999-2014. Darker colors indicate more overdose deaths relative to the state's population in 2014. 
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killers. “We expected to see some positive ef- 
fects regarding pain control but not quite to 
that extent and not with that many patients,” 
says Vikas Patel, chief of orthopedic spine 
surgery at CU. 

Now, Lindley’s study will enroll 50 patients 
with back and neck pain, who will visit the 
university three times and receive either va- 
porized cannabis, the opioid drug oxycodone, 
or a placebo. (In the case of cannabis, the 
placebo is marijuana with the THC chemi- 
cally extracted; for oxycodone, the placebo is 
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about 6 months. Prior to that, the university 
spent tens of thousands of dollars to install se- 
cure narcotics cabinets to meet DEA’s require- 
ments and a new ventilation system to com- 
ply with its own no smoking policy. 

In August, DEA rejected two petitions 
to remove marijuana from Schedule I. The 
decision was made after a scientific re- 
view by the Food and Drug Administration 
(FDA) concluded that the evidence for the 
medical benefits of marijuana did not meet 
their standards for new drug approval. 
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FDA noted that most cannabis studies 
to date have been fairly small—with a few 
dozen participants, not hundreds—and 
they’ve followed patients for a few hours, 
not the 12 weeks or more that’s typical in 
the clinical trials pharmaceutical compa- 
nies conduct. Another complication is the 
variation in how cannabis is delivered. Pa- 
tients in many early studies smoked it, and 
people ingest varying amounts of THC per 
puff. Newer delivery systems, such as vapor- 
izers and edible products, add still more un- 
certainty about the doses patients actually 
receive. Then there’s the natural variation 
in the concentration of THC and other can- 
nabinoids in different strains of marijuana. 

Even scientists who are bullish on the po- 
tential for medical marijuana acknowledge 
that consistent dosing is an issue. Yet many 
researchers see the situation as a Catch-22: 
The Schedule I listing and other restric- 
tions on marijuana research hinder the 
type of studies that are needed to convince 
regulators to loosen those restrictions. 

Two bills introduced in Congress this 
year aim to lower some of these hurdles. 
The bills would limit the time that DEA 
spends reviewing proposed research stud- 
ies (just as FDA has 30 days to review drug 
studies). They would also restrict DEA’s 
role in making sure listed drugs are stored 
securely. Now, DEA also has to weigh in 
on changes of scientific protocol, and that 
can really slow things down, says Vandrey 
of Johns Hopkins, who is collaborating 
on a study there to compare the analgesic 
effects of cannabis and the opioid drug 
hydrocodone in healthy subjects. 

A third bill, introduced in July, aims to 
ease research with cannabidiol and other 
chemical components of marijuana. “The 
current interpretation [of the Controlled 
Substances Act] is that anything in the 
plant is Schedule I,” Vandrey says. Even 
though there is no evidence that cannabi- 
diol is prone to abuse, researchers inter- 
ested in studying it have to jump through 
the same hoops as if their study involved 
whole-leaf marijuana. “That, in my mind, 
is just silly,’ Vandrey says. Research on ter- 
penes, still another group of cannabis com- 
pounds that may have analgesic effects, 
faces the same hurdles. 

“With changing attitudes and changing 
policy, ’m hopeful that research can pro- 
ceed with fewer barriers,’ McGill’s Ware 
says. He and others hope they’ll soon be 
able to firm up the case for marijuana as 
an effective pain treatment. “I’d hate to 
think we're still asking the same questions 
10 years from now,” Ware says. & 


Greg Miller is a science and technology 
journalist in Portland, Oregon. 
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Cold pressor tests estimate 
a person’s pain threshold 
based on how long they can 
tolerate ice-cold water. 
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Do opioids make chronic users more sensitive to pain? 


By Kelly Servick; Photography by Matthew Rakola 


ark Hutchinson could read the 
anguish on the participants’ 
faces in seconds. As a gradu- 
ate student at the University of 
Adelaide in Australia in the late 
1990s, he helped with studies 
in which people taking metha- 
done to treat opioid addiction 
tested their pain tolerance by 
dunking a forearm in ice water. Healthy con- 
trols typically managed to stand the cold for 
roughly a minute. Hutchinson himself, “the 
young, cocky, Aussie bloke chucking my arm 
in the water,’ lasted more than 2 minutes. 
But the methadone patients averaged only 
about 15 seconds. 
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“These aren’t wimps. These people are in- 
jecting all sorts of crazy crap into their arms. 
... But they were finding this excruciating,” 
Hutchinson says. “It just fascinated me.” The 
participants were taking enormous doses of 
narcotics. How could they experience such 
exaggerated pain? 

The experiment was Hutchinson’s first 
encounter with a perplexing phenomenon 
called opioid-induced hyperalgesia (OTH). At 
high doses, opioid painkillers actually seem 
to amplify pain by changing signaling in the 
central nervous system, making the body 
generally more sensitive to painful stimuli. 
“Just imagine if all the diabetic medications, 
instead of decreasing blood sugar, increased 
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blood sugar,” says Jianren Mao, a physician 
and pain researcher at Massachusetts Gen- 
eral Hospital in Boston who has studied 
hyperalgesia in rodents and people for more 
than 20 years. 

But how prevalent hyperalgesia is, and 
whether it plays a role in the U.S. epidemic 
of opioid abuse and overdose, is unclear. A 
lack of reliable testing methods and a se- 
ries of contradictory papers have created 
believers and skeptics. A few research- 
ers, like Mao, think hyperalgesia is an 
underappreciated puzzle piece in the opioid 
epidemic—a force that can pile on pain, drive 
up doses, and make it harder for chronic us- 
ers to come off their drugs. Some of those 
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researchers are looking for ways to turn 
down hyperalgesia, to help patients func- 
tion on lower doses of their oxycodone, for 
example, or make it easier to taper off it al- 
together. Others see OIH as an oddity in the 
literature—real, and a powerful clue to the 
workings of pain pathways, but unlikely to 
tighten the grip of opioids on most patients. 
Hutchinson thinks the majority of physi- 
cians are either unaware of hyperalgesia 
or unconvinced of its importance. “I think 
if you surveyed prescribers of opioids, 
they would be divided probably 60-40.” 


PARADOXICAL AS IT MAY SEEM, OIH makes 
evolutionary sense. “Nature didn’t come 
up with pain just to torture mankind,” says 
Martin Angst, an anesthesio- 
logist and clinical pharma- 
cologist at Stanford Univer- 
sity in Palo Alto, California. 
Pain causes us to recoil from 
a hot stove and to stay off 
an injured leg while it heals. 
And when it’s crucial that we 
temporarily ignore pain—say, 
when we run on that injured 
leg to evade a charging lion— 
the body has a way of numb- 
ing it, in part by releasing its 
own opioids. These natural 
molecules bind to receptors 
on neurons to block pain sig- 
nals and activate reward cen- 
ters in the brain. 

But doses of prescription 
opioids are orders of mag- 
nitude higher than our en- 
dogenous levels, Angst says. 
Confronted by these, “your 
biology fights back and says, 
‘Tm blindfolded to pain by all 
these chemicals. I need to be 
able to sense pain again-” 

Mao was among the first to delve into 
potential mechanisms of OIH in an animal 
model. In 1994, while at Virginia Common- 
wealth University in Richmond, he and his 
colleagues showed that after 8 days of spi- 
nal morphine injections, rats were quicker 
to pull their paws away from a gradually 
heated glass surface. The animals’ baseline 
pain threshold had changed, and the ef- 
fect was something more than tolerance, in 
which the body requires increasing doses of 
a drug to get the same effect. In this case, a 
higher dose could actually increase sensitiv- 
ity to pain. 

The researchers found they could reverse 
the hyperalgesic effect by blocking certain 
receptors on neurons in the animals’ spinal 
cord. These N-methyl-D-aspartate (NMDA) 
receptors pick up chemical signals—notably 
an excitatory molecule called glutamate— 
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released by sensory neurons projecting 
from the skin and organs, and transmit 
pain signals up to the brain. Researchers 
already knew that even without opioids, 
some people with chronic pain from nerve 
damage or fibromyalgia, for example, expe- 
rience hyperalgesia when normal pain sig- 
naling gets reinforced and amplified over 
time. It appeared that, at least in animals, 
opioids had a similar effect. 

By 2000, Mao was turning his attention 
to patients, and the population of opioid 
users was expanding. Doctors had begun 
to consider the drugs relatively safe options 
for managing chronic pain. With the release 
and aggressive marketing of the long-acting 
narcotic OxyContin in 1996, a class of drugs 


Some studies of hyperalgesia rely on gradually heated probes applied to the skin. 


that had largely been reserved for cancer 
patients was becoming a go-to treatment for 
conditions such as lower back pain. 

As prescribing skyrocketed, so did over- 
doses. U.S. deaths from prescription opi- 
oids have roughly quadrupled in the last 
2 decades, reaching 21,000 in 2014. Mak- 
ing things worse, abundant prescription 
opioids have been diverted for recreational 
use, which has driven up rates of heroin 
addiction as users have sought cheaper 
or more accessible alternatives. Both pre- 
scription and illegal opioids kill when high 
doses slow breathing, especially when com- 
bined with alcohol or antianxiety drugs 
called benzodiazepines. 

“fm not sure you could find an example 
of physicians doing more harm to human 
beings than we have achieved in our liberal 
opiate prescribing,’ says David Clark, an 
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anesthesiologist at Stanford. 

Mao and others wondered whether hyper- 
algesia was another important opioid side 
effect. People might be seeking a higher 
dose as drug-induced pain compounded the 
original pain, he thought. If so, doctors who 
ignore hyperalgesia might bump up the 
dose when the right decision was to reduce 
it. And when a patient tried to taper off a 
drug, a temporarily lowered pain threshold 
might make it harder for them to manage 
without it. “If they’re hyperalgesic, they can 
just go back to the drug again to feel okay,” 
says Jose Moron-Concepcion, a neuro- 
scientist at the Washington University 
School of Medicine in St. Louis in Missouri. 

The evidence for hyperalgesia is clear- 
est in people taking extreme 
doses—for instance, in opioid 
abusers or terminal cancer 
patients managing severe 
pain. Surgical patients given 
large amounts of the opioid 
remifentanil have shown 
signs of hyperalgesia; they 
have larger areas of soreness 
around their wounds and 
seem predisposed to chronic 
pain following surgery. But 
what about patients who take 
lower doses of opioids daily 
over months or years to man- 
age chronic pain? As a pain 
specialist at a large teaching 
hospital, Mao frequently en- 
counters patients who can’t 
find relief from increasing 
opioid doses and who tell him 
that their pain has become 
worse—diffuse, nagging, and 
harder to pinpoint. 

But just how many people 
experience OIH, and at what 
opioid dose, is hard to say. The 
phenomenon can be very hard to distinguish 
from tolerance, when pain increases as the 
drug loses its effectiveness over time. (It’s 
also possible that a patient’s underlying con- 
dition has changed, or that the chronic pain 
itself has kicked their pain signaling into 
high gear.) 

Because diagnosing hyperalgesia can be a 
guessing game in the clinic, some research- 
ers have turned to the lab. They have tried 
to document changing pain thresholds with 
quantitative sensory tests, like the so-called 
cold pressor test Hutchinson witnessed in 
the methadone patients in Australia, or con- 
traptions that apply heat or pressure to the 
skin. But the studies have been small and the 
results inconsistent. “Nobody has actually 
shown that that particular stimulus in a hu- 
man being is a valid way to say, ‘Yes, this per- 
son has become hyperalgesic,” Angst says. 


sciencemag.org SCIENCE 


PHOTO: MATTHEW RAKOLA 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


GRAPHIC: V. ALTOUNIAN/SCIENCE 


In 2006, for instance, a team that in- 
cluded Angst and Clark gave the cold pres- 
sor test to six people with chronic lower 
back pain before and after a monthlong 
course of morphine pills. After the drug 
treatment, the team found signs of hyper- 
algesia: On average, the subjects registered 
pain from the ice water about 2 seconds 
earlier, and removed their hands about 
8 seconds earlier, than they had before- 
hand. But those results didn’t hold up in 
a larger group of 139 patients random- 
ized to take opioids or placebo, nor did 
they appear in a different pain test that 
applied a gradually heated probe to the 
forearm. Then in 2013, a study with a 
different methodology seemed to con- 
firm the effect. A research team in Is- 
rael reported evidence of hyperalgesia in 
17 of 30 patients with radiating spinal nerve 
pain by asking them to rate the intensity of 
heat pain on a numerical scale before and 
after a 4-week course of hydromorphone. 

If you can’t reliably diagnose hyper- 
algesia, it’s hard to predict its long-term ef- 
fects, says Michael Hooten, an anesthesio- 
logist at the Mayo Medical School in 
Rochester, Minnesota. His group found 
evidence in 91 patients tapering off opi- 
oids that those whose doses were higher 
at the start, forcing them to make greater 
reductions over the 3-week program, had 
worse measures of heat pain hyperalgesia. 
But the team wasn’t able to track these 
patients long-term to ask the bigger ques- 
tions: How long until their pain thresholds 
bounced back to normal? Do hyperalgesic 
patients who manage to quit taking opioids 
ultimately see improvements in pain? Are 
hyperalgesic patients more or less prone to 
addiction or relapse? 

For some, this lack of evidence makes 
research into hyperalgesia look like a dead 
end. “When I go to work every day, I don’t 
think about opioid-induced hyperalgesia,” 
says Gary Bennett, a pain researcher at 
the University of California in San Diego. 
“We know that it’s real. We don’t know how 
important it is, and it’s really, really hard 
to answer that question, so let’s move on.” 


MAO ISN’T READY to move on. He believes 
the risk of hyperalgesia should motivate doc- 
tors to try tapering patients off their opioids 
when their pain worsens without an obvi- 
ous cause. But in his experience, only about 
a third of chronic pain patients are willing 
to try that. So he’s hoping for a different so- 
lution: a drug that targets the mechanisms 
behind hyperalgesia and that might be given 
alongside an opioid, either when it’s first pre- 
scribed or when a doctor suspects OIH. 

Mao is recruiting patients for clinical trials 
to test two candidate drugs. One is ketamine, 
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an anesthetic that blocks NMDA receptors. 
The other, guanfacine, is currently used to 
treat high blood pressure and is thought to 
keep sensory neurons from releasing glu- 
tamate into the spinal cord. A team led by 
Peggy Compton of Georgetown University in 
Washington, D.C., meanwhile, is investigat- 
ing a pain and antiseizure drug called gaba- 
pentin that may block neural transmission 
to reduce excessive pain signals. 

Other groups are attacking opioid side ef- 
fects, including hyperalgesia, from a very dif- 
ferent angle. In the early 2000s, researchers 
began exploring the role of glia, star-shaped 
immune cells in the brain and spinal cord, 
which were traditionally thought to function 
as mere “housekeepers,” offering structural 


Turning down the volume 


Animal studies have revealed several ways in which opioids may amplify 
pain signals in the central nervous system, suggesting targets for drugs 


that could counter the effect. 
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support for neurons and removing debris. 
But when the immune system becomes acti- 
vated in response to an illness or injury, glia 
in regions associated with pain processing 
seem to take on another role: They release 
inflammatory molecules that interact with 
nearby neurons to amplify pain signals. 

In 2001, researchers at the Chinese Acad- 
emy of Sciences in Shanghai reported that 
chronic morphine administration in rats 
activated glial cells called astrocytes in the 
spinal cord. Subsequent studies showed 
that inhibiting the inflammatory molecules 
released by glia could reverse hyperalgesia 
and tolerance in the rats. The results sug- 
gested that opioids may trigger glia to set 
off system-wide pain signaling that both 
counteracts the pain relief from the drug 
and makes the body generally more sensitive 
to pain. 
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more sensitive to 
incoming pain signals. 
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Many see dampening this inflammatory 
response as a promising way to fight hyper- 
algesia, because it would not interfere with 
opioids’ pain-relieving activity on neural 
receptors. Several efforts are underway. 
The San Diego, California-based biotech 
company MediciNova recently completed 
a phase II trial of a glia-inhibiting drug 
called ibudilast, already approved as an 
asthma treatment in Japan, to relieve pain 
and treat withdrawal in opioid abusers. A 
study led by researchers at Yale Univer- 
sity is testing the antibiotic acne medica- 
tion minocycline, which is also thought 
to block glial activation in the brain. 
And research spun out of neuroscientist 
Linda Watkins’s group at the University 
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of Colorado in Boulder is testing a new 
pain drug that may tame glia in the spi- 
nal cord by blocking a signaling protein on 
their surface. 

If inflammation turns out to be a key 
driver of OIH, it might also point the way to 
a better test for the effect, says Lesley Colvin, 
a pain researcher at the University of Edin- 
burgh. Markers of inflammation in the blood 
might correlate with clinical signs of hyper- 
algesia or declining pain thresholds on 
sensory tests. Colvin says she already 
sees strong evidence of hyperalgesia in 
high-dose opioid users at the clinic where 
she works. With so much at stake, she is 
eager to understand the phenomenon 
and how it might affect them long term. 
“Although it’s complicated,” she says, “that 
doesn’t mean we shouldn’t try and work out 
the details.” & 
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Pain regulation by non-neuronal 
cells and inflammation 


Ru-Rong Ji,”?* Alexander Chamessian,’ Yu-Qiu Zhang* 


Acute pain is protective and a cardinal feature of inflammation. Chronic pain after arthritis, 
nerve injury, cancer, and chemotherapy is associated with chronic neuroinflammation, a local 
inflammation in the peripheral or central nervous system. Accumulating evidence suggests 
that non-neuronal cells such as immune cells, glial cells, keratinocytes, cancer cells, and stem 
cells play active roles in the pathogenesis and resolution of pain. We review how non-neuronal 
cells interact with nociceptive neurons by secreting neuroactive signaling molecules that 
modulate pain. Recent studies also suggest that bacterial infections regulate pain through 
direct actions on sensory neurons, and specific receptors are present in nociceptors to detect 
danger signals from infections. We also discuss new therapeutic strategies to control 
neuroinflammation for the prevention and treatment of chronic pain. 


linically, inflammation is characterized 

by five cardinal signs: rubor (redness), 

calor (increased heat), tumor (swelling), 

dolor (pain), and functio laesa (loss of 

function). Acute inflammation is a protec- 
tive response involving immune cells, blood 
vessels, and molecular mediators (inflamma- 
tory mediators). The function of inflammation 
is to eliminate the initial cause of cell injury 
and initiate tissue repair. Acute pain, also known 
as nociceptive pain, is a cardinal feature of in- 
flammation. The majority of known inflamma- 
tory mediators cause pain by binding to their 
receptors on nociceptive primary sensory neu- 
rons in the peripheral nervous system (PNS) 
(nociceptors) that innervate injured skin, mus- 
cle, and joint tissues (Fig. 1) 7-3). Once thought 
to be a passive process, the resolution of acute 
inflammation is now recognized as a distinct, 
active process involving specialized pro-resolution 
mediators (SPMs) such as resolvins, protectins, 
and maresins, derived from omega-3 unsaturated 
fatty acids (2, 4), as well as other pro-resolution 
mechanisms (5). Resolvins not only regulate the 
resolution of acute inflammation but also inhibit 
inflammatory pain through direct actions on no- 
ciceptors through specific receptors; activation 
of ChemR23 by resolvin El potently inhibits tran- 
sient receptor potential ion channel V1 (TRPVI), 
a key ion channel for pain transduction (4). 

In contrast to acute inflammation, chronic 
inflammation is often detrimental, leading to a 
host of diseases such as periodontitis, athero- 
sclerosis, rheumatoid arthritis, and even cancer 
(2). It is unclear whether chronic inflammation 
is also as critical for driving chronic pain as acute 
inflammation is for acute pain. Pain research in 
the past several decades has established that 
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neuronal plasticity is a key mechanism for the 
development and maintenance of chronic pain 
(, 6). Peripheral sensitization in nociceptors is 
essential for the development of chronic pain 
(3) and transition from acute pain to chronic 
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Arthritis, nerve injury, cancer treatment 
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Other cells 


Neuroactive signal 
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Fig. 1. Interactions between non-neuronal cells, neurons, and 
inflammation/neuroinflammation in different pain conditions 
after injury and insult. Non-neuronal cells can modulate pain 
in different directions by producing either pro- or anti-nociceptive 


mediators. 


pain (7). Central sensitization (enhanced responses 
of pain circuits in the spinal cord and brain) 
regulates the chronicity of pain, causes the spread 
of pain beyond the site of injury, and influences 
the emotional and affective aspects of pain (8). 
Neuroinflammation is a localized inflammation 
occurring in the PNS and central nervous system 
(CNS) in response to trauma, neurodegenera- 
tion, bacterial or viral infection, autoimmunity, 
and toxins (2, 9). The hallmarks of neuroinflam- 


< 


| Neuroinflammation 
& inflammation 


mation are activation and infiltration of leuko- 
cytes, activation of glial cells, and increased 
production of inflammatory mediators. Neuro- 
inflammation is also associated with changes of 
vascular cells that facilitate leukocyte infiltration 
(2, 9). Compared with inflammation, neuroin- 
flammation is more persistent in chronic pain 
conditions and therefore plays a more important 
role in chronic pain maintenance (2). For exam- 
ple, fibromyalgia, a widespread chronic pain syn- 
drome, is associated with small fiber neuropathy 
and neuroinflammation, although its correlation 
with systemic inflammation is unclear (J0). 

The interactions between inflammation and 
pain are bidirectional (Fig. 1). Nociceptive sen- 
sory neurons not only respond to immune signals 
but also directly modulate inflammation. For ex- 
ample, nociceptors express receptors for and re- 
spond to cytokines and chemokines and also 
produce these inflammatory mediators (11, 12). In 
a process called neurogenic inflammation, nox- 
ious stimulation causes nociceptors to release 
neuropeptides such as Substance P (SP) and cal- 
citonin gene-related peptide (CGRP), leading to 
the extravasation of fluid and cells from the blood. 
Consistently, silencing nociceptors reduces allergic 
airway inflammation (73). Noci- 
ceptors also serve to dampen and 
constrain the immune response: 
Ablation of nociceptors abrogated 
pain during bacterial infection but 
concurrently worsened inflamma- 
tion via CGRP (74). Activation of 
pain circuits also regulates neuro- 
inflammation in the CNS, referred 
to as neurogenic neuroinflamma- 
tion in chronic pain and neuro- 
degenerative diseases (9). 

Numerous non-neuronal cell 
types influence pain sensation, 
including immune, glial, epithe- 
lial, mesenchymal, cancer, and 
bacterial cells. In this Review, we 
focus on non-neuronal cells that 
interact with nociceptors in dis- 
tinct anatomical compartments 
in the PNS and CNS (glial cells) 
under normal and pathological 
conditions (Fig. 2). Despite the 
diversity of these cells, the ways 
in which they modulate pain 
are surprisingly consistent. In 
response to an injury or insult, 
non-neuronal cells release neu- 
romodulatory substances in close 
proximity to nociceptors, which 
either promote or dampen pain 
depending on the specific identities of the me- 
diators involved (Figs. 1 and 2). Although the de- 
tails will differ in each case, this general model 
will serve as a useful lens through which to 
examine specific non-neuronal cell types in pain. 


Pain modulation by non-neuronal cells 
Monocytes and macrophages 


Monocytes and macrophages serve three main 
functions in the immune system: phagocytosis, 
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antigen presentation, and cytokine production. 
Monocytes and macrophages in the periphery 
play an active role in pain, exhibiting diverse 
mechanisms that are shaped by the causes and 
context of pain. In most cases, these cells produce 
pain through the release of proinflammatory 
mediators such as tumor necrosis factor (TNF) 
and interleukin-18 (IL-1B) (1D, resulting in en- 
hanced pain transduction and conduction via 
modulation of ion channels such as transient 
receptor potential ankyrin 1 (TRPA1), TRPV1, 
and Navl.7-1.9 (, 2). Cell-specific depletion of 
proliferating monocytes and macrophages im- 
pairs the development of mechanical and ther- 
mal hypersensitivity caused by sterile incision 
and pathogens, in parallel with a decrement 
in IL-18 and other pro-algesic mediators at the 
site of inflammation (15). However, in a nerve 
injury model, deletion of peripheral monocytes 
does not affect neuropathic pain development 
(16). In a model of chemotherapy-induced neu- 
ropathic pain, CX3CR1* monocytes migrate into 
peripheral nerves and produce reactive oxy- 
gen species, which then elicit pain via the ac- 
tivation of TRPA1 (77). After peripheral nerve 
injury, infiltration of monocytes and macro- 
phages to the spinal cord is limited, arguing 
against a central role of these cells in neuro- 
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pathic pain (6, 18). Conversely, monocytes and 
macrophages can also effect analgesia by re- 
leasing anti-inflammatory mediators such as 
IL-10 and SPMs, together promoting the res- 
olution of the initial insult (2, 4). In support of 
this view, depletion of monocytes and ma- 
crophages delayed the resolution of inflam- 
matory pain (19). Macrophages have different 
phenotypes related to their functional states, 
including proinflammatory M1-like and anti- 
inflammatory M2-like phenotypes, which may 
play distinct roles in the induction and resolu- 
tion of pain. 


T lymphocytes 

T cells are critical elements of adaptive immu- 
nity that have also been implicated in pain, 
with the most evidence for a role in neuropathic 
pain. After nerve injury, T cells infiltrate the 
dorsal root ganglia (DRG) and release the pro- 
algesic mediator leukocyte elastase (LE), re- 
sulting in mechanical allodynia. Consistently, 
inhibitor of LE reduces nerve injury-induced 
allodynia (20). In the spinal cord, T cell infil- 
tration occurs after nerve injury and is required 
for the development of mechanical hypersen- 
sitivity, as indicated by a reduction of pain be- 
haviors in T cell-deficient Ragi-null mice (21). 
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A recent report supported the role of spinal 
T cells in nerve injury-induced neuropathic 
pain but limited this function only to female 
mice, with male mice instead depending on 
microglial signaling for pain. This intriguing 
finding was attributed to the sexually dim- 
orphic expression of peroxisome proliferator- 
activated receptors (PPARs) o and y in T cells 
(22). Different types of T cells play different 
roles in chronic pain. For example, adoptive 
transfer of toxic T cells (CD8* T cells) via in- 
trathecal injection enhances neuropathic pain, 
whereas injection of regulatory T cells (Treg 
cells) decreases neuropathic pain after chemo- 
therapy (23). 


Keratinocytes 


Keratinocytes are the primary cells of the epi- 
dermis. They reside near the peripheral terminals 
of nociceptors and produce various neuroactive 
mediators such as adenosine 5’-triphosphate 
(ATP), IL-1B, prostaglandin E2, endothelin, and 
nerve growth factor (NGF) that are known to 
elicit pain, suggesting that these cells can di- 
rectly activate nociceptors. Photostimulation 
of channelrhodopsin-expressing keratinocytes 
is sufficient to generate nocifensive behaviors 
and evoke action potentials in specific subsets 
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Fig. 2. Interactions between distinct parts of a nociceptor with dif- 
ferent types of non-neuron cells. Included are keratinocytes, Schwann 
cells, satellite glial cells, oligodendrocytes, and astrocytes, as well as im- 
mune cells (for example, macrophages and T cells), microglia, cancer cells, 
and stem cells. These non-neuronal cells produce both pro-nociceptive 
(highlighted in red) and anti-nociceptive (highlighted in blue) mediators, 
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which can bind their respective receptors on the nociceptor to modulate 
its sensitivity and excitability. The central terminal of the nociceptor 
forms a nociceptive synapse with a postsynaptic neuron in the spinal 
cord dorsal horn to mediate pain transmission in the CNS. END, en- 
dorphin; ET, endothelin; Glu, glutamate; IFN-o, interferon-a; LE, leuko- 
cyte elastase. 
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of cutaneous nociceptive afferents (24). Kera- 
tinocytes are multifunctional, promoting pain 
or analgesia depending on the condition and 
context. The familiar experience of sunburn il- 
lustrates the dual nature of these cells. Before 
the pain of sunburn sets in, the feeling of sun 
on the skin is a pleasurable experience. Here, 
keratinocytes participate by releasing the endo- 
genous opioid peptide B-endorphin (B-END), 
which produces both analgesia and reward 
(25). After light overexposure, when inflam- 
mation and injury occur, keratinocytes also 
elicit pain via the activation of TRPV4 and the 
subsequent release of endothelin onto cutane- 
ous nociceptors (26). Although keratinocytes 
are not immune cells, they can release inflam- 
matory mediators including cytokines (27), il- 
lustrating how nonimmune cells can also regulate 
inflammation. 
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Fig. 3. Neuron-glial interactions in the spinal cord for the amplification of chronic pain. Painful injuries such 
as nerve injury, arthritis, cancer, and treatment (chemotherapy) cause hyperactivity of nociceptors and secretion of 
glial modulators from their central terminals, leading to the activation of microglia and astrocytes in the spinal cord 
dorsal horn. Upon activation, microglia and astrocytes secrete neuromodulators to drive chronic pain by inducing 
synaptic and neuronal plasticity. Pre- and postsynaptic neurons can both “listen” and “talk” to microglia and astro- 


cytes. CASP6, caspase-6. 


Central glial cells: Microglia, astrocytes, 
and oligodendrocytes 

Microglia serve as the resident macrophages of 
the spinal cord and brain, and abundant evi- 
dence supports their role in pathological pain 
(28, 29). A hallmark of microglia is their rapid 
activation in response to even minor patholog- 
ical changes in the CNS. Peripheral nerve injury 
induces a marked proliferation and activation 
of microglia in parallel with up-regulation of 
microglial markers such as IBA1 and CD11b in 
the spinal cord. Indeed, spinal microglia have 
been strongly implicated in the pathogenesis of 
neuropathic pain after nerve injury (18, 29-32). 
The signals that activate these microglia include 
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ATP, colony-stimulating factor-1 (CSF1), chemo- 
kines (CCL2 and CX3CLI), and proteases, which 
can originate from injured or activated sen- 
sory neurons (Fig. 3). In parallel, expression of 
the receptors for ATP and CX3CLI (P2X4, P2X7, 
P2Y12, and CX3CRI) is increased selectively on 
spinal microglia in response to nerve injury 
(28, 29). Activation of these receptors typically 
converges on an intracellular signaling cascade 
involving the phosphorylation of p38 mitogen- 
activated protein (MAP) kinase, which leads to 
the increased production and release of TNF-a, 
IL-18, IL-18, and brain-derived growth factor 
(BDNF), as well as increased expression of cyclo- 
oxygenase (COX) and subsequent synthesis of 
prostaglandin E, (6). These neuromodulators 
can then fine-tune both excitatory and inhib- 
itory synaptic transmission, which ultimately 
enhances pain signal transmission to the brain. 
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For example, TNF and IL-1B enhance excita- 
tory synaptic transmission and suppress in- 
hibitory synaptic transmission in spinal cord 
lamina II neurons (33). BDNF disrupts chloride 
homeostasis and suppresses y-aminobutyric acid 
(GABA)-mediated inhibitory synaptic trans- 
mission (disinhibition) in spinal lamina I pro- 
jection neurons (6, 34). Selective activation of 
spinal microglia by fractalkine (CX3CL1) is suf- 
ficient to rapidly facilitate synaptic strength 
between primary afferent C-fibers and lamina 
I projection neurons (35). Caspase-6 is only ex- 
pressed in axonal terminals in the spinal cord 
and mediates neuron-microglial interaction. Re- 
combinant caspase-6 triggers marked TNF re- 
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lease from microglia and drives robust acute 
inflammatory pain behaviors before the mor- 
phological activation of microglia (36). Spinal 
microglia signaling is sex-dependent, and mi- 
croglial signaling inhibitors such as minocy- 
cline and p38 inhibitor reduce neuropathic pain 
primarily in male mice, with little to no effect 
on female mice (22). Although microglia are 
critical to the development of chronic pain, they 
may function in the maintenance of chronic 
pain as well. For example, examination of mi- 
croglial enhancers by using chromatin immu- 
noprecipitation sequencing (ChIP-seq) revealed 
persistent alterations in close proximity to trans- 
criptionally regulated genes, leading to the hy- 
pothesis that these changes may contribute to 
the “memory” recorded at a molecular level 
(18). Microglia are also involved in opioid-induced 
hyperalgesia through microglia-mediated dis- 
ruption of neuronal chloride 
homeostasis (34). Morphine par- 
adoxically prolongs neuropathic 
pain in rats, and selective in- 
hibition of spinal cord microglia 
in vivo through DREADD (De- 
signer Receptor Exclusively Acti- 
vated by Designer Drugs) reverses 
morphine-induced pain sensitiza- 
tion for weeks (37). In the future, 
it will also be important to de- 
termine whether microglia are 
involved in the resolution of 
chronic pain, which would be ex- 
pected on the basis of the dual 
role of their peripheral macro- 
phage counterparts (19). 
Astrocytes perform numerous 
critical functions such as neuro- 
transmitter recycling, formation 
of the blood-brain barrier, regula- 
tion of extracellular ion concentra- 
tion, and modulation of synaptic 
transmission, among many others. 
Nerve injury induces myriad changes 
in astrocytes that lead to enhanced 
pain. For example, after nerve in- 
jury, astrocytes lose their ability 
to maintain the homeostatic con- 
centrations of extracellular potas- 
sium (K*) and glutamate, leading 
to neuronal hyperexcitability (28). 
Astrocytes can also signal directly 
to neurons through physically coupled networks 
mediated by gap junctions to facilitate inter- 
cellular transmission. Gap junction communi- 
cation is mediated by connexin-43 (Cx43), the 
predominant connexin expressed in astrocytes. 
Nerve injury induces persistent up-regulation 
of Cx43 in astrocytes and switches the function 
of Cx43 from gap junction communication to 
paracrine modulation (38). This paracrine regu- 
lation leads to the increased release of gluta- 
mate, ATP, and chemokines through a paracrine 
mechanism. The astrocyte-derived chemokines 
act as neuromodulators and can potentiate ex- 
citatory synaptic transmission in the spinal cord 
pain circuitry (38). Furthermore, nerve injury 
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up-regulates CXCL13 in spinal cord neurons, 
which can activate astrocytes via CCR5 to main- 
tain neuropathic pain (39). Thus, chemokines 
facilitate neuropathic pain via bidirectional 
neuron-astrocyte interactions (Fig. 3). Nerve 
injury also induces spinal cord and cortical 
astrocytes to up-regulate thrombospondin-4 
(TSP4), which promotes neuropathic pain through 
the formation of new synapses and rewiring 
of somatosensory cortical circuits (40, 4D. A 
single human astrocyte may contact more than 
1 million synapses, and such complexity points 
to amore important role of astrocytes in humans 
that bears further investigation. As compared 
with microglial activation, astrocyte activation 
in chronic pain conditions is more persistent, 
indicating their contribution to the chronicity 
of pain (28). 

Oligodendrocytes create the myelin sheath 
that provides support and insulation to axons in 
the CNS (Fig. 2). In spite of their ubiquity and 
importance, only recently has a role for oligo- 
dendrocytes been revealed in pain. In the chronic 
constriction injury model of nerve injury-induced 
neuropathic pain, oligodendrocyte-derived IL-33 
contributes to pain hypersensitivity via MAP ki- 
nases and nuclear factor «B (NF-«B) (42). Sim- 
ilarly, in post mortem spinal cord samples from 
HIV patients, expression of oligodendrocyte mark- 
ers such as NG2, Oligo2, and platelet-derived 
growth factor receptor a (PDGFRa) increases, 
reflecting a persistent activation of oligoden- 
drocytes in chronic pain (43). Conversely, toxin- 
mediated ablation of oligodendrocytes induces 
neuropathic pain symptoms, suggesting a po- 
tential protective role of these cells (44). These 
divergent findings suggest that as with other 
non-neuronal cells, oligodendrocytes play active 
and context-specific roles in pain. 


Peripheral glia: Schwann cells 
and satellite glial cells 


Like their central counterparts, the major glial 
cells of the PNS, the Schwann cells and the 
satellite glial cells (SGCs), contribute to pain. In 
response to painful stimuli, these peripheral 
glial cells are activated before central glia and 
release various inflammatory mediators, sen- 
sitizing nociceptors at axons (Schwann cells) 
and cell bodies (SGCs). After nerve injury, ac- 
tivated Schwann cells mediate the breakdown 
of the blood-nerve barrier via the secretion of 
matrix metalloproteinase 9 (MMP-9), which 
promotes the recruitment of immune cells from 
the vasculature and their subsequent release 
of more pro-nociceptive mediators (27). SGCs 
surround the somata of DRG neurons and are 
directly coupled to each other via gap junctions. 
After nerve injury, SGCs become activated and 
proliferate (27). SGCs contribute to chronic pain 
sensitization by producing cytokines and MMPs 
that regulate the cleavage and activation of cy- 
tokines (28). Nociceptive activity also causes ATP 
release from neuronal soma to activate P2X7 in 
SGCs, leading to TNF release from SGCs and sub- 
sequent increase in neuronal excitability (45). 
Activation of P2X7 receptors in SGCs also reduces 
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pain through down-regulation of P2X3 receptors 
in nociceptive neurons (46). 


Stem cells 


Bone marrow stromal cells or bone marrow 
stem cells (BMSCs) produce many beneficial 
effects to promote tissue regeneration and tis- 
sue repair by secreting growth factors. These 
cells can also effectively control inflammation 
and neuroinflammation by secreting anti- 
inflammatory mediators. The duration of pain 
relief by BMSCs is remarkable after systemic or 
local injection (47, 48). A single intrathecal in- 
jection of BMSCs inhibits nerve injury-induced 
neuropathic pain for many weeks via secretion 
of transforming growth factor-$1 (TGF-B1), a 
potent anti-inflammatory cytokine (48). The 
analgesic effects of intrathecal BMSCs were 
abolished by a neutralizing antibody against 


“.non-neuronal cells 
can communicate with 
nociceptive neurons by 
‘listening’ and ‘talking’ 
to neurons...” 


TGF-B1. Intrathecal BMSCs effectively suppress 
nerve injury-induced glial activation and neuro- 
inflammation in DRGs and spinal cord. After 
intrathecal injection, BMSCs migrate to DRG 
via a chemotaxic signal (CXCL12) triggered by 
nerve injury (48). This paracrine modulation 
of pain by BMSCs is very different from other 
stem cell strategies for chronic pain manage- 
ment. For example, implantation of forebrain 
GABAergic precursor cells into the spinal cord 
reduced nerve injury-induced neuropathic pain 
because these precursor cells can differentiate 
into functional GABAergic neurons in the spinal 
cord (49). 


Cancer cells 


Cancer pain is a complex pain state, involving 
inflammatory, neuropathic, compressive, and 
ischemic mechanisms. Cancer pain is commonly 
caused by tumors that metastasize from dis- 
tant sites to the bone and by tumors infiltrating 
the nerve. Cancers generate and secrete algo- 
genic mediators (such as protons, bradykinin, 
endothelins, prostaglandins, and proteases) that 
sensitize and activate nociceptors in the can- 
cer microenvironment (50). In particular, can- 
cer cells secrete NGF and vascular endothelial 
growth factor (VEGF), which induce hyper- 
innervation of pain-mediating nerve fibers in 
cancer tissues (50, 57). VEGF promotes cancer 
pain through a nonvascular regulation. VEGF 
receptor 1 (VEGFR1) is expressed in nocicep- 
tors, and tumor-derived VEGF family members 
increase nociceptor excitability and produce 
pain hypersensitivity through selective activa- 
tion of VEGFR1 (57). Certain cancer cells, such 


as Walker-256 rat mammary gland carcinoma 
cells, also secrete TGF-B. In early stages of can- 
cer, TGF-B acts as a tumor suppressor through 
antiproliferative and proapoptotic actions. 
During tumor progression, TGF-B becomes an 
oncogenic factor and causes angiogenesis, im- 
munosuppression, and proliferation, invasion, 
and metastasis of cancers. TGF-B1 also promotes 
bone cancer pain (52). Thus, this anti-inflammatory 
cytokine might have different roles in different 
pain conditions via different signaling mecha- 
nisms. Microglia also play an active and distinct 
role in bone cancer pain. Spinal microglia con- 
tribute to the maintenance of bone cancer pain 
in female rats (53). Bone cancer causes increased 
ATP secretion in cerebrospinal fluid and up- 
regulation of P2X7 receptor in spinal microglia. 
Activation of P2X7 results in increased syn- 
thesis and release of IL-18 via p38 MAP kinase. 
Spinal inhibition of the P2X7/p-38/IL-18 path- 
way reduces the advanced phase of bone cancer 
pain and suppresses hyperactivity of spinal-wide 
dynamic-range neurons (53). This result sug- 
gests that spinal cord microglia have an active 
role in late-phase bone cancer pain in fe- 
male rats. Further investigation is warranted 
to test sex-dependent regulation of microglial 
signaling in different animal models and also 
in different phases of chronic pain. Some can- 
cers such as melanoma are not painful in the 
majority of patients. It is possible that mela- 
noma may produce some analgesic molecules 
such as B-endorphin to suppress pain (Fig. 2), 
which can promote the optimal growth of can- 
cers, dampening the warning function of pain. 


Pain modulation by bacterial 
infections through direct interactions 
with sensory neurons 


Infections caused by bacterial, fungal, and viral 
pathogens are commonly associated with pain. 
Until recently, it was generally believed that in- 
fections cause pain in an indirect manner through 
the intermediating effects of immune cells and 
the inflammatory substances they secrete (Fig. 4). 
However, emerging evidence suggests that path- 
ogens can also directly activate nociceptors and 
elicit pain using a suite of previously unknown 
and intriguing mechanisms (/4). For example, 
the bacterium Staphylococcus aureus possesses 
a pore-forming toxin called a-hemolysin, which 
enables it to destroy host cells and also to cause 
pain by directly forming pores in nociceptors, 
permitting cation influx and action potential 
firing through metallopeptidase A disintegrin 
and metalloproteinase domain-containing pro- 
tein 10 (ADAM10) (74). Lipopolysaccharide (LPS) 
is the best-studied element derived from bacte- 
rial cell walls and is also known to produce pain 
hypersensitivity by sensitizing TRPV1 in noci- 
ceptors (54). Bacteria also produce pain by se- 
creting substances that are ligands for receptors 
on nociceptors, as is the case with N-formyl pep- 
tides, which are metabolic by-products of bacte- 
ria that activate nociceptors and elicit pain by 
binding to the formyl peptide receptor 1 (FPR1), 
a G protein-coupled receptor. The pain produced 
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through these mechanisms is independent of 
the immune response (/4). Bacterial infection 
is not always painful, however. Mycobacterium 
ulcerans, the etiological agent of Buruli ulcer, 
causes extensive skin lesions but not pain. 
M. ulcerans produces remarkable analgesia, 
through a specific activation of sensory neurons 
triggered by the secreted mycobacterial poly- 
ketide mycolactone. Mycolactone elicits anal- 
gesia through activation of type 2 angiotensin II 
receptors (AT2Rs), leading to the activation of 
K2P potassium channels TRAKK (KCNK4) and 
hyperpolarization of nociceptors (55). 

Although viruses are not cells, viral infections 
are frequently associated with inflammation 
and pain, such as in sore throat. Several types of 
viruses—such as herpes simplex virus 1 (HSV-1), 
HSV-2, and VZV (varicella zoster virus)—can infect 
sensory neurons to produce painful syndromes. 


Bacteria, tissue injury 


Macrophages Astrocytes 


Microglia 


TLR2, TLR4 


' 


Spinal cord 


TNF, IL-1, IL-18 
CCL2, CXCL1 


' 


Synaptic plasticity 


Pain 


(pathogen-activated molecular patterns) but 
also by DAMP (danger-activated molecular pat- 
terns), the endogenous ligands released after 
tissue injury. Activation of TLR4 in spinal mi- 
croglia and astrocytes critically contributes 
to the development and maintenance of inflam- 
matory and neuropathic pain, as well as opioid- 
induced hyperalgesia, by eliciting the synthesis 
of TNF-o and IL-18 (37, 56, 57). Secretion of these 
cytokines interacts with nociceptive neurons in 
the spinal cord pain circuitry (Fig. 4). 

Primary sensory neurons in the DRG and tri- 
geminal ganglia also express TLRs (54, 58-60). 
TLRs are differentially expressed in primary sen- 
sory neurons and modulate different sensory 
functions. TLR4 appears to be functionally cou- 
pled with TRPV1 in nociceptors, so that LPS bind- 
ing to TLR4 increased Ca?* influx in sensory 
neurons (54, 60). In contrast to its intracellu- 
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port of certain small molecules across the 
neuronal membrane through a to-be-determined 
mechanism. This Af-fiber-specific transport 
process was exploited to deliver the membrane- 
impermeable anesthetic QX-314, resulting in 
selective blockade of A-fibers. Silencing A-fibers 
in this manner revealed that these neurons 
inhibit pain under normal conditions but give 
rise to mechanical allodynia in chemotherapy- 
induced neuropathy (59), in support of the 
“Gate Control Theory.” Further studies are war- 
ranted to investigate the ion channels and sig- 
naling molecules that give rise to this previously 
unknown function of TLR5. 

Whereas coupling of TLRs with ion channels 
in nociceptors is important for the induction of 
acute pain, activation of conventional TLR sig- 
naling in nociceptors through myeloid differen- 
tiation factor 88 (MyD88), a key downstream 


Sensory 
neuron terminals 


Pain 


Pain 


Fig. 4. Infections and tissue injury regulate pain via both neuronal and non-neuronal mechanisms through TLRs. Bacterial infections can mod- 
ulate pain through direct interactions with specific receptors such as FPR and TLR on primary sensory neurons, including nociceptors and mechano- 
ceptors, leading to increased and decreased pain sensitivity. TLRs are expressed by neurons, glial cells, and immune cells and are activated by PAMP 
(infections), as well as DAMP, endogenous TLR ligands (such as HMGBI1 or miRNAs) released after tissue injury. HMGB1, high mobility group box 1 protein; 


ssRNA, single-stranded RNA. 


Shingles, also called herpes zoster, is a painful skin 
rash and usually appears in a defined region of 
face or body that corresponds to dermatomes. 
It is most common in older adults and people 
who have weak immune systems. Viral infections 
may activate Toll-like receptors (TLRs) in nocicep- 
tors to modulate pain and neuroinflammation. 


Toll-like receptors mediate 
bilateral neuron-immune and 
neuron-glial interactions 


Mounting evidence suggests that TLRs are in- 
strumental in bilateral interactions between 
neurons and non-neuronal cells (Fig. 4). TLRs 
are typically expressed by immune and glial cells 
to regulate innate immunity through the ac- 
tivation of the NF-«B and MAP kinase signaling 
pathways. TLRs are activated not only by PAMP 
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lar localization in immune cells, TLR7 expresses 
on the cell surface as well as in axons of DRG 
nociceptors. Activation of TLR7 by synthetic 
ligands causes immediate activation and ex- 
citation of nociceptors (67). MicroRNA (miRNA) 
let-7b is an endogenous ligand of TLR7 and 
activates TLR7 through the GUUGUGU motif. 
Let-7b causes rapid activation and excitation 
of nociceptors through the coupling of TLR7 
and TRPAI. Consistently, intraplantar injection 
of let-7b elicits rapid spontaneous pain through 
TLR7 and TRPAI (58). 

TLRS is expressed by large-sized A-fibers, es- 
pecially AB fibers that normally mediate touch 
sensation. The canonical ligand for TLRS is fla- 
gellin, a globular protein that forms the filament 
in bacterial flagella. In large A-fibers, activa- 
tion of TLR5 by flagellin permits the rapid trans- 


signaling molecule for TLRs, is required for the 
development of chronic pain after painful in- 
sults (23, 58). Deletion of MyD88 in nociceptors 
results in deficits in neuropathic pain as well 
as in innate and adaptive immunity in the PNS 
after chemotherapy-induced neuropathy (23). 
Thus, nociceptors share similar features with 
immune cells by expressing TLRs, cytokines, 
chemokines, and inflammatory cascades (62). 
Through the coupling with ion channels, no- 
ciceptor TLRs act as a distinct class of “danger 
receptors” for the rapid detection of PAMP and 
DAMP. Thus, primary sensory neurons not only 
sense pain, itch, temperature, and touch (63) but 
also sense “danger” (DAMP and PAMP, which are 
partially overlapped with algogenic and pruri- 
gogenic agents), and this danger sensation 
will trigger protective responses such as a rapid 
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withdrawal or scratching response (67) as well 
as delayed innate and adaptive immunity re- 
sponses through MyD88 signaling in sensory 
neurons (23). This evolutionarily conserved sig- 
naling pathway persists because of its critical 
biological function, but excessive activity of the 
same pathway can also lead to autoimmune 
diseases if left unchecked. 


Concluding remarks 
and future directions 


Chronic pain is a rising health concern world- 
wide, affecting up to 30% of adults. Despite this 
enormous burden, the current treatments for 
chronic pain are inadequate. Chronic pain typ- 
ically includes inflammatory pain after tissue 
injury, cancer pain, and neuropathic pain after 
nerve injury as a result of diabetic neuropathy, 
viral infection, or major surgeries. Common treat- 
ments, such as chemotherapy and chronic opioid 
treatment, also cause chronic pain. In the past 
decade, great progress has been made to demon- 
strate the critical roles of glial cells and immune 
cells—as well as keratinocytes, cancer cells, and 
stem cells—in the pathogenesis and resolution 
of chronic pain. These non-neuronal cells can 
communicate with nociceptive neurons by “lis- 
tening” and “talking” to neurons, modifying their 
activities by secreting neuroactive signaling mole- 
cules that can be both pro- and anti-nociceptive 
(Figs. 1 to 3). There is increasing appreciation 
that many inflammatory mediators such as cyto- 
kines and chemokines are indeed neuromodu- 
lators in the PNS and CNS (2). Thus, chronic 
pain could be caused by both “neuropathy” and 
“gliopathy” (28), as well as dysregulations of other 
non-neuronal cells (Fig. 2). We extend the bound- 
ary of non-neuronal cells to include immune 
cells, cancer cells, and stem cells because they 
can infiltrate damaged or normal tissues to in- 
teract with nociceptors and also produce neu- 
roactive modulators to alter pain states. Given 
the close link between inflammation and pain, 
we discussed non-neuronal cells in the context 
of inflammation and especially neuroinflamma- 
tion (2) that is tightly associated with chronic 
pain. Inflammation and neuroinflammation can 
be modulated not only by immune cells but also 
by other non-neuronal cells such as keratinocytes, 
stem cells, and cancer cells and even by noci- 
ceptive neurons in the PNS and CNS. Infections 
by bacteria and other pathogens not only affect 
immune and glial cells but also have a direct im- 
pact on sensory neurons, eliciting rapid nocicep- 
tive responses through activation of neuronal 
receptors such as FRP1 and TLRs. In particular, 
TLRs in immune or glial cells and sensory neurons 
not only sense the danger signals from tissue in- 
jury and infections but also regulate bilateral in- 
teractions between neurons and neuron-neuronal 
cells, and these interactions are important for 
the induction and chronicity of inflammation 
and pain. 

Although several types of non-neuronal cells 
contribute to the pathogenesis of pain, deletion 
of a cell type entirely could be detrimental. We 
should bear in mind that these non-neuronal 
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cells, except for cancer cells, are protective in the 
normal conditions but dysregulated in patholog- 
ical conditions. Although blocking their function 
may help pain relief for a short period, restoring 
their normal function through pro-resolution 
approaches is probably the best way. On the 
basis of this principle, we propose the following 
therapeutic strategies as future directions. First, 
different pharmaceutical strategies should be 
considered at different development stages of 
pain and inflammation. Anti-inflammatory drugs 
such as inhibitors of TNF are more effective in 
the early stage of pain and inflammation. Res- 
olution of inflammation is a new therapeutic 
frontier (5), and SPMs such as resolvins and 
protectins may promote the resolution of in- 
flammation and pain, which can be partially 
achieved via dietary control (2). In particular, 
neuroprotectin D1 is effective in controlling 
neuroinflammation and neuropathic pain (64). 
Second, cell therapies such as implantation of 
BMSCs are promising, demonstrating long-term 
pain relief and even final resolution of pain 
(47, 48). However, only a very small percentage 
of cells in bone marrow are stem cells for autol- 
ogous transplantation. Instead, macrophages are 
an abundant cell source in blood, and implan- 
tation of resident and M2-like macrophages 
could be an alternative to promote the resolution 
of inflammation and pain. Alternatively, induc- 
ible pluripotent stem cells (iPSCs) from abundant 
cells (such as skin or blood) could be transformed 
into the desired cell type and then transplanted. 
Moreover, transcription activator-like effector nu- 
clease (TALEN)- or clustered regularly interspaced 
short palindromic repeats (CRISPR)-mediated 
gene editing ex vivo could first be used to enhance 
the production of pro-resolution mediators and 
the migration capability of the cells to be trans- 
planted so that they can target the damaged tis- 
sue more accurately. Last, neuromodulation such 
as vagus nerve stimulation and acupuncture has 
been used for inflammation control and pain 
treatment (65, 66). In addition to local acute 
analgesia via adenosine from non-neuronal cells 
(67), acupuncture and sciatic nerve activation 
also regulate inflammation through vagus nerve 
activation and dopamine release (68). It will be 
of great interest to test whether neuromodula- 
tion can also help to restore the normal function 
of non-neuronal cells in chronic pain as a non- 
pharmaceutical approach. 
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REVIEW 


Neural circuits for pain: 
Recent advances and current views 


Cedric Peirs’” and Rebecca P. Seal’’?* 


The mammalian nervous system encodes many different forms of pain, from those 

that arise as a result of short-term low-grade interactions with noxious thermal, chemical, 
or mechanical sources to more serious forms of pain induced by trauma and disease. In 
this Review, we highlight recent advances in our understanding of the neural circuits that 
encode these types of pain. Promising therapeutic strategies based on recent advances 


are also highlighted. 


he ability to sense pain protects us from 

harm and is thus an essential aspect of 

our well-being. Patients suffering from 

channelopathies that eliminate the ability 

to feel pain have very high rates of early 
mortality, largely due to self-mutilation and re- 
petitive fractures (7). Pain, of course, is also a 
source of substantial discomfort, and humans 
have long sought ways to ameliorate pain, as 
exemplified by the Ebers Papyrus from ancient 
Egypt. As a concept, pain has evolved from ar- 
chaic notions of demonic punishment to more 
contemporary views of biological circuit-based 
origins for the sensation (2). The most recent 
definition of pain proposed by the International 
Association for the Study of Pain, unchanged 
since its first publication in 1979, is “an unplea- 
sant sensory and emotional experience associ- 
ated with actual or potential tissue damage, or 
described in terms of such damage.” The full 
biological complexity of pain and its underlying 
circuitry is not wholly conveyed by this defini- 
tion, however. Pain stems from a varied array of 
peripheral sensors that detect nociceptive stim- 
uli within internal tissues and those produced 
by the external world. The information is sub- 
sequently distributed to a series of complex 
neural circuits in the spinal cord dorsal horn 
and then to numerous brain regions, producing 
a diverse set of emotions, actions, and sensa- 
tions (Fig. 1). Pain also manifests with different 
qualities, including stabbing, pricking, burning, 
or aching, further highlighting the heterogene- 
ity of the underlying neural circuitry. Longer- 
lasting pain states produced by nerve and tissue 
damage, which are also heterogeneous in na- 
ture, provide ongoing awareness of the injured 
area. Pain can become chronic and debilitating 
when the tissue damage persists and, in some 
conditions, even after the wound has healed. 
Because of the high prevalence and lack of ade- 
quate treatment options, unraveling the bi- 
ological basis of persistent pain continues to 
be an area of intense study. The objective of 
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this Review is to provide an overview of pain 
circuitry, with an emphasis on recent impor- 
tant advances in our understanding of path- 
ological pain. Insights into neural circuits for 
pain that have therapeutic potential are also 
discussed. 


How is pain detected in the periphery? 


Pain is produced through the activation of no- 
ciceptive primary sensory neurons categorized 
into the classes C, A6, and to a lesser extent AB, 
depending on their axon caliber, degree of mye- 
lination, and conductivity properties. Nociceptors 
innervate the skin, deep tissues, and internal 
organs and are tuned to detect a wide variety of 
noxious mechanical, thermal, and chemical stim- 
uli through the activation of modality-specific 
sensory transduction molecules. A number of 


“Pain stems from a varied 
array of peripheral sensors 
that detect nociceptive 
stimuli within internal 
tissues and those produced 
by the external world.” 


these key molecules have been identified (3). 
Most if not all modalities are conveyed through 
the activation of more than one transducer, as 
exemplified by heat transduction. Genetic de- 
letion of the heat-sensing transient receptor 
potential channel vanilloid 1 (TRPV1) in mice 
impaired the response to heat only at high nox- 
ious temperatures (4), pointing to the existence 
of one or more additional heat sensors. Recent 
efforts have now identified TRPM3 (5) and the 
calcium-activated chloride channel ANOI (6) as 
important contributors to the sensation of heat. 
As was observed with TRPVI1, deletion of either 
channel alone strongly reduced, but did not 
completely eliminate, noxious heat sensitivity. 
In contrast, ablation of TRPV1* fibers by in- 
trathecal injection of the TRPV1 ligand capsai- 
cin (7) or silencing the cells by selective uptake 
of the voltage-gated sodium channel blocker 


QX-314 (8) completely abolished noxious heat 
sensitivity, demonstrating that the fibers are 
sufficient to account for heat pain, despite a 
more limited contribution from each channel. 
Given that TRPV1, TRPM3, and ANO1 show 
considerable overlap in their distribution, analy- 
ses of double and triple genetic deletion of the 
channels in mice may be required to fully un- 
derstand at the cellular level how noxious heat 
is transduced. Cold is also transduced through 
multiple channels. Mice with a genetic deletion 
of the menthol-sensitive TRPMS8, a critical trans- 
ducer of innocuous cooling, showed only par- 
tial avoidance of noxious cold temperatures, 
whereas selective ablation of TRPM8-expressing 
cells completely abolished noxious cold sensi- 
tivity (9). The molecules that transduce me- 
chanical nociception remain stubbornly elusive. 
Deletion of the leading candidate molecules in 
mice unexpectedly produced little to no change 
in noxious mechanical pain sensitivity (10, 11). 


Transduction in the periphery: 
More than just neurons 


Epithelial cells such as keratinocytes and Merkel 
cells directly interact with peripheral axon ter- 
minals and have been implicated in the mod- 
ulation of sensory transduction. Recently, several 
elegant studies have demonstrated an active 
role for epithelial cells in tuning the response of 
sensory neurons. Merkel cells are the epider- 
mal end organ of slowly adapting type 1 (SA1) 
mechanoreceptors and are involved in the 
detection of features such as edges and textures. 
Depolarization of the Merkel cells by optoge- 
netic stimulation or by touch, which depends 
on the innocuous mechanotransduction chan- 
nel Piezo2, directly excites SA1 fibers through 
still-unidentified neurotransmitter signaling 
mechanisms (12). Epithelial cells have also been 
shown to actively contribute to nociceptive trans- 
mission. Activation of channelrhodopsin or 
TRPVI1 channels ectopically expressed by kera- 
tinocytes in mice induced action potential firing 
in sensory neurons, neuronal activity in the 
spinal cord, and nocifensive behaviors (13, 14). 
Conversely, inhibition of the cells blocked the re- 
sponse of nociceptors and other primary sensory 
neurons to cutaneous stimulation (74). Studies 
that identify the salient signaling molecules or 
establish the extent to which epithelial cells con- 
tribute behaviorally to pain will be of interest. 


Is pain a labeled line? 


Most nociceptors express various combinations 
of sensory transducers that are specific for dif- 
ferent modalities (often heat and mechanical) 
and therefore are classified as “polymodal.” 
Large-scale efforts using single-cell RNA tran- 
scriptome analyses have sought to better un- 
derstand the molecular and functional logic of 
primary sensory neurons by parsing the tran- 
scriptome profiles of hundreds of individual 
cells into groups defined by convergent gene ex- 
pression patterns (15-17). These cluster analyses 
offer a richer picture of the diversity of noci- 
ceptors than previous molecular marker-based 
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classifications, as well as a deeper understand- 
ing of the molecular constituents that give rise 
to the functional properties of sensory neurons 
(Fig. 2). Although the precise relationships 
between the molecular constituents and the 
response properties of the neurons will require 
further study, the transcriptome data do show 
some predictive capacity: For example, the ty- 
rosine hydroxylase (TH*) cluster expresses high 
levels of Piezo2, but not the heat sensors TRPV1, 
TRPMB, or ANOI, consistent with the response 
of the cells to light touch, but not to heat (18-20). 
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However, limits on the ability to predict the 
functional output of nociceptors on the basis of 
their molecular and biophysical properties were 
also recently demonstrated in studies in which se- 
lect polymodal populations were ablated in mice. 
Ablation of nociceptors that express the Mas- 
related G protein-coupled receptor subtype D 
(MRGPRD) markedly reduced acute and persist- 
ent mechanical pain but had no effect on thermal 
sensitivity (7). Similarly, ablation of the calcitonin 
gene-related peptide (CGRP) neurons, a popula- 
tion that partially overlaps with the heat sensors 


CHT peptidergic 
f AS HT 
if C HT non-peptidergic 
C/AS LTMR = 


ABLTMR 


<= Mechanical 


TRPV1, TRPMB8, and to a lesser extent ANO1, 
predictably produced a profound loss of heat but 
did not alter innocuous or noxious mechanical 
sensation (27). Loss of CGRP-expressing sensory 
neurons, which are not responsive to cooling 
agents (22), greatly increased tonic and evoked 
activity in spinal neurons associated with cold. 
This latter finding, mechanistically resembling 
the activation of dorsal horn itch circuits after 
silencing pain afferents (pain normally inhibits 
itch at the level of the spinal cord), further 
highlights the complexity of primary afferent 
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Fig. 1. Overall organization of somatosensory circuits. Cutaneous sen- 
sory neurons (DRG) are activated by a variety of stimuli (bottom left) and 
project to the spinal cord dorsal horn (DH, middle left). In the DH (right), 
the central terminals of high-threshold nociceptors (HT) are located in 
the most superficial laminae [lamina | to the dorsal part of inner lamina II 
(llig)] and lamina V. Low-threshold mechanoreceptors (LTMk) preferen- 
tially end in the deep dorsal horn [ventral part of inner lamina II (Iliy) to 
lamina V]. The spinal cord is divided into 10 laminae (the DH is | to VI) and is 
composed of numerous neuronal populations. Some identified popula- 
tions are organized in longitudinal layers (only excitatory neurons are 
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represented): neurons transiently expressing VGLUT3 (tVGLUTS, orange) in 
laminae III and IV, PKCy (green) in lamina Ili, calretinin (blue) in outer lamina II 
(ll,) and lamina lIlig, vertical cells (yellow) in lamina Il, and projection 
neurons (red) in laminae I, IV, and V. Projection neurons send information 
to the brainstem and thalamus and then on to several brain regions im- 
plicated in sensory-discriminative (upper left, light brown) and emotional 
(upper left, dark brown) sensory perception. ACC, anterior cingular cortex; 
SI (ID, primary (secondary) somatosensory cortex; PAG, periaqueductal gray 
area; PB, parabrachial nucleus; AMY, amygdala; PFC, prefrontal cortex; BG, 
basal ganglia. 
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coding, as well as an important role for central 
processing in shaping the sensory percept. 


Integration in the spinal cord 


Primary sensory neurons innervate the six Rexed 
laminae of the spinal cord dorsal horn, the major 
locus for the integration of peripheral sensory 
input and descending supraspinal modulation. 
The central terminals of sensory neurons are 
somatotopically organized, forming ventrodorsal- 
oriented columns similar to the arrangement of 
the primary somatosensory cortex. Most C- and 
A6-nociceptive afferents form synaptic contacts 
in the superficial laminae (I and II), whereas 
low-threshold A6- and Af-afferents generally 
project to deeper laminae (III to V) (Fig. 1). 
Accordingly, in vivo electrophysiological record- 
ings in mammals have shown that laminae I 
and II respond mainly to noxious peripheral 
stimulation, whereas neurons in deep layers are 
more sensitive to touch. The dorsal horn is com- 
posed of a large number of excitatory (75%) 
and inhibitory (25%) interneuron populations, 
as well as a smaller number of projection pop- 
ulations located in laminae I, III, IV, and V. The 
projection neurons relay information to numer- 
ous supraspinal sites to give rise to both qual- 
itative and affective aspects of the pain sensation. 


Recent studies of dorsal horn nociceptive circuits 
have focused largely on persistent pain because 
of its high prevalence and the need for better 
treatment options (23, 24). 


Mechanical allodynia: 
Peripheral contributions 


Nerve and tissue damage produces dramatic 
changes in peripheral and central somatosen- 
sory circuits. Although acute pain typically 
follows the activation of nociceptors, light touch- 
activated neurons can be recruited into the 
nociceptive network to cause pain after injury. 
This pathological condition, termed mechan- 
ical allodynia, occurs in numerous peripheral 
neuropathies and central pain disorders, pre- 
senting in up to 50% of patients with neuro- 
pathic pain (25). Still unknown is the identity 
of the light touch-sensitive primary sensory 
population(s) that conveys mechanical allody- 
nia. A large number of primary afferents re- 
spond to light mechanical touch and are thus 
potential candidates (26). 

Although the identity of the mechanosensory 
neurons engaged during mechanical allodynia 
remains unclear, progress has been made in our 
understanding of innocuous sensory mechano- 


transduction (27). As mentioned previously, in- 


nocuous tactile sensitivity is primarily dependent 
on the cation channel Piezo2 (28). In primary 
sensory neurons, the channel is almost exclu- 
sively expressed by low-threshold mechanore- 
ceptors. Conditional deletion of Piezo2 in all 
sensory neurons in mice resulted in markedly 
reduced mechanical activation of low-threshold 
mechanoreceptors, as well as reduced behav- 
ioral touch sensation, with no change in acute 
mechanical pain (29). Because low-threshold 
mechanoreceptors are thought to convey some 
forms of mechanical allodynia, Piezo2 could have 
a role in this type of pain—although after an 
inflammatory insult, mice with a conditional 
deletion of Piezo2 in primary sensory neurons 
did not show evidence of a defect in their me- 
chanical pain threshold under the conditions 
tested. In contrast, a separate study, which dem- 
onstrated sensitization of Piezo2 by the cyclic 
adenosine monophosphate sensor EPACI after 
neuropathic injury, pointed to a potential role 
for the mechanotranducer in mechanical al- 
lodynia (30). 


Mechanical allodynia: The silent dorsal 
horn circuit 


Changes in the dorsal horn circuitry contribute 
to the expression of mechanical allodynia. One 


A Unmyelinated sensory neurons 
@ 
NAV1.8/9 TRPM8& 
PV TRKA RET TRKB TH TRPV1 MRGPRD 
ASIC1 CGRP VGLUT3 
TRKC CALB1 SST MRGPRA3 TAC1 
| | NPPB CGRP CGRP 
Proprioception — Mechanical Touch Touch Touch Touch Itch Itch Heat Mechanical Cold 
B 
! Eas 
I, [7 TRPM8 7“ 
Il, MRGPRA3 CGRP TRPV1 
MRGPRD 


TRKA 


PV 


Fig. 2. Organization of primary sensory neurons. (A) The categories of 
myelinated and unmyelinated neurons and their respective functional roles 
are derived from large-scale transcriptional analyses, behavioral analyses, and 
the literature. (B) The locations of the central terminations of the primary 
sensory neuron categories are shown. The schematic is based on analyses of 
gene reporter mouse lines (PV, RET, TRKB, TH, MRGPRD, MRGPRA3, CGRP, 
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and TRPM8) and immunohistological analyses (TRKA, TRPV1, TAC1). Mye- 
linated neurons preferentially express neurofilament heavy chain (NEFH), and 
unmyelinated neurons preferentially express the sodium channels Nav1.8 
and -1.9. Laminae are indicated on the left. RET, ret proto-oncogene; 
CALBI, calbindin 1; TAC1, tachykinin 1; SST, somatostatin; NPPB; natri- 
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of the principal concepts to explain these changes 
is based on the gate control theory in which 
touch inhibits pain through a feed-forward in- 
hibitory circuit within the superficial layers of 
the dorsal horn (37). In the case of mechanical 
allodynia, injury impairs the feed-forward in- 
hibitory circuit, allowing light touch to engage 
the nociceptive network. A seminal study, per- 
formed in rodent spinal cord slices, showed 
that low-threshold mechanosensory A-fiber in- 
put is able to activate lamina I pain projection 
neurons through a polysynaptic network when 
inhibitory receptor antagonists are present 
(mimicking the injury-induced decrease in in- 
hibition) (32). Importantly, the study established 
that the mechanical allodynia circuit, which is 
normally silent, is already in place under phys- 
iological conditions. 


Dorsal horn mechanical allodynia 
circuits: Excitatory neurons 


The dorsal horn circuit for mechanical allodynia 
was originally modeled as a dorsally directed 
pathway that begins with excitatory interneu- 
rons at the border between laminae II and III 
that express the y isoform of protein kinase C 
(PKCy) and ends with lamina I pain projection 
neurons (33) (Fig. 3). Two intermediary pop- 
ulations in this pathway were recently identi- 
fied through paired recordings, the transient 
central cells in inner lamina II and vertical cells 
in outer lamina II (34). The dendrites of vertical 
cells extend deep into lamina III and receive 
input from most classes of primary afferents, 
suggesting a potential role as integrators of 
the network (35). An expansion of the circuit 
into deeper laminae was shown by studies of the 
vesicular glutamate transporter 3 (VGLUT3). A 
population of excitatory interneurons was iden- 
tified in lamina III that transiently expresses 
VGLUTS3 and receives almost exclusively low- 
threshold input. Selective activation of the cells 
by using a designer receptor strategy (DREADD) 
further demonstrated their important role in the 
transmission of touch as painful (36). Calretinin- 
expressing excitatory interneurons in inner 
lamina II were also identified as an essential 
element of the circuit. A functional study of the 
calretinin and PKCy populations that used c-Fos 
analysis revealed the existence of distinct ex- 
citatory microcircuits for mechanical allodynia, 
which are differentially engaged depending on 
whether the injury is inflammatory or neuro- 
pathic in nature (36). 


Mechanical allodynia circuits: 
The “gates” 


Inhibitory neurons are a fundamental element 
of the mechanical allodynia network in the 
spinal cord, forming “gates” that prevent the re- 
cruitment of low-threshold fibers into the no- 
ciceptive network under physiological conditions 
(Fig. 3). Using paired recordings in spinal cord 
slices, a recent study showed that the PKCy 
interneurons, which receive AB-fiber input, are 
normally under feed-forward glycinergic disy- 
naptic inhibition (34). After neuropathic injury, 
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Fig. 3. Organization of the dorsal horn circuit for pain. Peripheral nociceptors (blue) project 
onto excitatory interneurons in lamina Il, (central cell, dark gray; vertical cells, yellow) and onto 


neurokinin 1 receptor (NK1R) projection neurons (red) in lamina |. Nonpeptidergi 


c afferents ex- 


pressing MRGPRD (green) project to lamina Ilig, including to excitatory vertical cells with ventrally 


directed elongated dendrites. Both primary afferents also contact inhibitory islet ce 
elongated, pink). Stimulation of nociceptive afferents activates excitatory centra 
cells, and NK1R projection neurons to mediate noxious pain. Inhibitory islet cells 


activity. Innocuous afferents (orange) project onto excitatory interneurons expressi 


Is (horizontally 
| cells, vertical 
modulate this 
ng tVGLUTS3 in 


lamina III (brown), PKCy in lamina lliy (teal), and vertical cells in lamina Il,. Myelinated afferents also 
contact PV inhibitory interneurons in lamina III (radial, pink) and dynorphin inhibitory vertical cells 


in lamina Il, (vertical, pink). tVGLUT3 interneurons project onto excitatory vertica 


cell dendrites 


and intermediate excitatory interneurons in lamina Ill. Intermediate interneurons project onto PKCy 
and calretinin excitatory interneurons in lamina Ili. Inhibitory interneurons prevent A-fiber—mediated 
activation of the nociceptive network through feed-forward circuits that act on PKCy interneurons, 
vertical cells, and NK1R projection neurons. After nerve injury, inhibition by PV and dynorphin in- 


terneurons is reduced, allowing A-fiber—-mediated activation of a dorsally directed 


circuit that in- 


cludes tVGLUT3 and PKCy neurons. After inflammatory injury, reduction in a still-unknown mechanism 
of disinhibition allows A-fiber-mediated activation of a dorsally directed circuit that includes tVGLUT3 


and lamina Ili, calretinin neurons. 
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which is known to impair inhibitory transmis- 
sion in the dorsal horn, Af-fibers were able to 
drive action potentials in PKCy neurons and 
presumably the rest of the nociceptive network. 
Consistent with this finding, selective ablation 
of lamina III parvalbumin (PV)-expressing in- 
terneurons, which form contacts with the PKCy 
neurons, induced mechanical allodynia, indi- 
cating a role for PV interneurons in gating the 
allodynic pathway (37). Conversely, selective 
activation of the PV interneurons by the ex- 
citatory DREADD strategy reversed mechanical 
allodynia induced by nerve injury. In addition 
to PKCy neurons, PV interneurons were also 
proposed to act directly on the Af-afferents in- 
nervating these neurons, suggesting alternative 
mechanisms for inhibitory control of the circuit 
(38). The involvement of a second population 
of inhibitory interneurons in the feed-forward 
gating of mechanical allodynia was also re- 
cently demonstrated. Ablation of dynorphin- 
expressing interneurons in outer lamina II and 
at the border of laminae II and III induced 
mechanical allodynia. The dynorphin gate was 
further demonstrated to control somatostatin- 
expressing interneurons, a major population of 
excitatory neurons that partially overlaps with 
PKCy neurons and potentially also vertical and 
transient central cells (39). Studies of the me- 
chanical allodynia network are beginning to not 
only reveal individual elements but also produce 
conceptual advances, including the understand- 
ing that there are distinct excitatory micro- 
circuits and multiple distinct gates related to 
specific types of injuries. 


Mechanisms of disinhibition underlying 
mechanical allodynia 


The central sensitization responsible for me- 
chanical allodynia produces various forms of 
disinhibition in the dorsal horn (40). One mech- 
anism involves the injury-induced release of 
brain-derived neurotrophic factor (BDNF), which 
triggers down-regulation of the potassium chlo- 
ride exchanger 2 (KCC2) in tropomyosin receptor 
kinase B (TRKB)-expressing pain-transmitting 
neurons, reducing the driving force for chloride 
and thus the strength of inhibitory transmis- 
sion (41, 42). The mechanism of injury-induced 
BDNF release was recently shown to differ be- 
tween males and females. Males require the 
activation of microglia, whereas females require 
adaptive immune cells, possibly T lymphocytes 
(43). Sex differences are frequently observed in 
animal models of chronic pain and may provide 
insight into the many clinically observed sex 
differences in pain in humans (44). 
Investigations into the mechanisms under- 
lying BDNF release in the dorsal horn of male 
rodents after injury have focused on adenosine 
triphosphate signaling through purinergic P2X4 
receptors, which requires microglia activation, 
and on the involvement of chemokine signaling 
cascades that promote and maintain microglia 
activation (45). De novo production and release 
of colony-stimulating factor 1 (CSF1) by injured 
primary sensory neurons is required for the 
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initial activation of the microglia, as well as for 
cell proliferation and self-renewal. Deletion of 
CSF specifically from primary sensory neurons 
or deletion of its downstream effector, the mi- 
croglial membrane adaptor protein DAP12, 
was sufficient to markedly reduce microglial 
activation, as well as the behavioral expres- 
sion of mechanical allodynia induced by nerve 
injury (46). 


Pain processing in the brain 


Functional magnetic resonance imaging (fMRI) 
studies in humans have demonstrated the co- 
ordinated activation of several brain areas in 
response to noxious somatic and visceral stim- 
uli, including the thalamus, anterior cingulate 
cortex (ACC), insular cortex, primary and sec- 
ondary sensory cortices, prefrontal cortex, basal 
ganglia, cerebellum, and amygdala. This net- 
work of brain regions involved in both sensory- 
discriminative and emotional-affective aspects 
of pain is termed the “pain matrix” (47). 
Several recent studies have questioned whether 
activation of the pain matrix specifically repre- 
sents pain or is a more generalized system to 
detect salient events and the state of the body. 
The posterior insular cortex encodes nociceptive 


“pain-related information 
is likely encoded by specific 
subregional patterns of 
activity throughout the 
pain matrix...” 


intensity, and lesions in this brain area disrupt 
pain, whereas direct stimulation of the area in- 
duces pain. However, the insular cortex also has 
a role in affective, cognitive, and homeostatic 
functions for nonnociceptive sensory modalities. 
Intracerebral recordings of the anterior and 
posterior insular cortex showed activity levels 
that responded similarly to different sensory 
modalities (visual, auditory, and somatosensory), 
as well as to noxious and innocuous somato- 
sensory stimuli, consistent with the idea that 
the insula is not specific for pain-related infor- 
mation (2). Additionally, patients with loss-of- 
function mutations in the sodium channel Nav1.7 
(SCN9Q) show congenital insensitivity to pain due 
to a loss of noxious peripheral sensory drive (1). 
In a recent study, the response of patients and 
controls to normally noxious peripheral stimuli 
was assessed by fMRI. Both groups reported 
experiencing similar levels of sensation; al- 
though controls reported the stimuli as painful 
and patients reported no pain, the two groups 
nevertheless showed similar patterns and levels 
of activity within the pain matrix (48). Gross 
activity measured in supraspinal structures thus 
may not reliably predict the pain experience. 
Rather, pain-related information is likely en- 
coded by specific subregional patterns of activity 


throughout the pain matrix, which would re- 
quire analysis by other methods and by exper- 
imental designs that allow for causal inference. 
Machine-learning algorithms to assess fMRI 
patterns of activity across the pain matrix are 
showing a high predictive capacity for distinct 
forms of physical pain, as well as for social pain 
such as vicarious pain (49). 

Complementing these recent findings from 
brain-imaging studies in humans, work in ro- 
dents has focused on synaptic-level mechanisms 
of pain processing and plasticity in particular 
regions of the pain matrix. These studies pro- 
vide further support for the idea that individ- 
ual brain areas have multiple functional roles. 
Emotional aspects of pain such as anxiety and 
fear have been linked to the ACC. A study in 
mice identified a form of presynaptic long- 
term potentiation (pre-LTP) at thalamocorti- 
cal synapses in the ACC that was induced by 
either anxiety-evoking experiences or chronic 
pain models (50). It was concluded that the 
pre-LTP represents anxiety, which can be trig- 
gered by chronic pain. The coexistence of this 
form of pre-LTP with a previously identified 
form of postsynaptic LTP required for the be- 
havioral expression of chronic pain (57) suggests 
an important role for these ACC synapses in the 
mutually enhancing interactions of anxiety and 
chronic pain. 

Mechanisms of plasticity affecting motiva- 
tion in models of persistent pain were also 
investigated recently. The nucleus accumbens 
has been implicated in subjective pain pro- 
cessing in humans and in motivation in ro- 
dents. The induction of persistent pain by 
either nerve injury or an inflammatory medi- 
ator impaired motivational behavior in mice, 
as measured using a progressive ratio oper- 
ant task (52). The decrease in motivation was 
linked to enhanced release of the neuropep- 
tide galanin in the nucleus accumbens core, 
which induced a form of long-term depression 
(LTD) in dopamine receptor 2-expressing me- 
dium spiny neurons (D2-MSNs), thereby reduc- 
ing their activity. The injury-induced effects on 
D2-MSNs and motivated behavior were reversed 
by knockdown of the galanin receptor GALR1 
and by interfering with intracellular signaling 
pathways required for the LTD. Synaptic-level 
investigations of pain-related mechanisms are 
thus providing important insights into the role 
of individual brain areas in the pain experience, 
including interactions with other brain func- 
tions. Such studies are also revealing potential 
new therapeutic avenues. 


Blocking pain circuits at the periphery 


Nerve growth factor (NGF) and its major recep- 
tor TRKA have essential roles in nociceptor func- 
tion both during development and in adults. 
Specific mutations in the genes encoding NGF 
or TRKA in humans produce phenotypes that 
include insensitivity to pain, largely resulting 
from defects in the development of nocicep- 
tors (53). In adults, NGF signaling through 
TRKA receptors at the periphery contributes 


sciencemag.org SCIENCE 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


considerably to the pain associated with many 
types of injuries and diseases’ including bone 
cancer, lower back injury, diabetic neuropathy, 
and osteoarthritis. As a new class of analgesic 
treatment, humanized antibodies raised against 
NGF, such as tanezumab (Pfizer), now in phase 
III clinical trials for osteoarthritis pain, are showing 
promise. Inhibitors of the TRKA receptor are also 
of therapeutic interest. 

Recent studies have focused also on the ther- 
apeutic potential of targeting voltage-gated so- 
dium channels in sensory neurons to relieve 
pain. In addition to the Navl.7 loss-of-function 
mutations that cause congenital insensitivity 
to pain in humans, studies of Navl1.7 and Nav1.8 
in mice indicate a required role for the chan- 
nel in many forms of acute and pathological 
pain (54, 55). A highly specific Nav1.7 inhibitor 
isolated from centipede venom, u-SLPTX-Ssmé6a, 
when injected intraperitoneally in mice, showed 
potent analgesic effects on chemical-induced 
pain with an efficacy equivalent to or exceed- 
ing that of morphine (56). Similarly, monoclo- 
nal antibodies against Navl.7, when injected 
into mice, effectively reduced mechanical al- 
lodynia and thermal hyperalgesia induced by 
inflammation and neuropathic injury (57). Given 
the promising therapeutic benefit of targeting 
Navl.7 channels to relieve pain, several selective 
inhibitors are now in phase II clinical trials. 

Animal venoms that elicit pain have been 
particularly useful for identifying and charac- 
terizing therapeutically relevant nociceptive 
sensory molecules. MitTx, from the venom of 
the Texas coral snake Micrurus tener tener, is 
a highly specific and strong potentiator of the 
acid-sensing channel ASIC1. Injection of this 
peptide into the hind paw of mice induced pro- 
nounced nocifensive behaviors (58). Conversely, 
central and peripheral delivery of the isopep- 
tides mambalgin 1 and 2, from the black mam- 
ba snake, reversed mechanical allodynia and 
heat hyperalgesia induced by inflammatory 
agents by inhibiting ASIC channels, highlight- 
ing the therapeutic potential of this family of 
ion channels (59). 

Recent advances in the in vivo application 
of optogenetic methods also provide attractive 
therapeutic opportunities for pain. One of the 
first studies to report the use of optogenetics 
to evaluate pain circuits in vivo targeted chan- 
nelrhodopsin to Nav1.8-expressing primary sen- 
sory neurons in mice. Cutaneous light-mediated 
activation of the neurons elicited robust no- 
cifensive behaviors and conditioned place aver- 
sion (60). In testing of opsins for therapeutic 
purposes, the inhibitory opsin NpHR was tar- 
geted to small-diameter primary sensory neu- 
rons by injection of AAV6-NpHR into the sciatic 
nerve. Cutaneous light-mediated inhibition of 
the infected primary sensory neurons alleviated 
both inflammatory and neuropathic pain (62). 
The low penetration of light through the skin 
may be technically limiting; however, implant- 
able miniaturized optoelectronic systems for 
wireless control of primary sensory and spinal 
cord neurons are now being developed (62). 
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Another innovative approach to inhibit no- 
ciceptors targets the voltage-gated sodium chan- 
nel blocker QX-314: to select populations, thereby 
blocking nerve conduction. Coapplication of cap- 
saicin, which permits entry of QX-314 through 
the cation channel TRPVI, to the mouse plantar 
hind paw markedly reduced the response to heat 
and mechanical pressure (8). QX-314 has also 
been used to ameliorate persistent pain. The 
Toll-like receptor TLR5 was identified as a re- 
ceptor specific to A-fibers (63). Coapplication 
of the TLR5 ligand flagellin and QX-314 to 
the mouse plantar hind paw blocked A-fiber 
activity, as measured electrophysiologically in 
vitro and in vivo. Although the precise mecha- 
nism by which QX-314 enters the cells is still 
unclear, the approach successfully reversed me- 
chanical allodynia and ongoing pain in a num- 
ber of chronic neuropathic pain models in mice 
without producing obvious deficits in touch 
sensation. 

Lastly, the use of gene therapy to restore the 
normal expression of dysregulated proteins in 
sensory neurons has become another attract- 
ive strategy to treat pain. The expression of 
the microRNA miR-7a is dramatically reduced 
after neuropathic injuries. Increasing its ex- 


“gene therapy to restore 
the normal expression 

of dysregulated proteins 
in sensory neurons has 
become another attractive 
strategy to treat pain.” 


pression by intraganglionic injection of AAV6- 
miR-7a abolished mechanical allodynia and 
thermal hyperalgesia induced by nerve injury, 
although not by inflammation (64). In contrast, 
nerve injury substantially up-regulates the so- 
dium channel Navl1.3. Intraganglionic injec- 
tion of AAV5 expressing a small hairpin RNA 
against Navl.3 reduced its expression and par- 
tially reversed nerve injury-induced mechanical 
allodynia (65). 


Blocking pain circuits centrally 


Because mechanisms of disinhibition in the 
dorsal horn have a critical role in the expression 
of mechanical allodynia and spontaneous pain, 
therapeutic efforts have been directed toward 
the restoration of inhibitory tone. Administra- 
tion of a small-molecule enhancer of KCC2 
activity (CLP290) to rodents restored inhibitory 
drive in dorsal horn neurons, as measured elec- 
trophysiologically in spinal cord slices, and 
reversed behavioral evidence of mechanical 
allodynia induced by nerve injury. Similarly, 
overexpression of the cotransporter KCC2 in 
the spinal dorsal horn and primary sensory 
neurons of rodents reversed behavioral evi- 
dence of mechanical allodynia induced by 


peripheral nerve injury (66). Humans with a 
mutation in the HSN2-containing splice vari- 
ant of the kinase WNK1 suffer from congenital 
insensitivity to pain. WNK1/HSN2 plays a role 
in nerve injury-induced mechanical allodynia 
by reducing KCC2 activity in lamina II neu- 
rons of the dorsal horn (67). Antagonizing WNK 
activity in rodents restored GABA (y-aminobutyric 
acid)-mediated inhibition of spinal cord lam- 
ina II neurons and reversed the behavioral 
evidence of mechanical allodynia induced by 
peripheral nerve injury, but not by inflamma- 
tory injury. 

Using an entirely different strategy, based on 
the observation that GABAergic precursors from 
the medial ganglion eminence grafted into the 
forebrain correct neurological disorders of hy- 
perexcitability, such as epilepsy, Braz et al. tested 
whether injecting the telencephalic GABAergic 
precursors directly into the dorsal horn of 
nerve-injured mice could restore inhibitory tone 
and reverse mechanical allodynia (68). Although 
the precursor cells maintained their cortical 
identity, they integrated successfully into the 
dorsal horn, forming local synaptic connections 
with primary sensory and spinal cord neurons. 
By 3 weeks after transplantation, mechanical 
thresholds were elevated to pre-nerve injury 
levels. In contrast, inflammation-induced hyper- 
sensitivity was not altered by cell transplanta- 
tion. This work demonstrates that strategies to 
reverse or compensate for injury-induced dis- 
inhibition in the dorsal horn show promise for 
the treatment of pain. 


Conclusions and the road ahead 


Recent advances at the molecular, cellular, and 
systems level are transforming our understand- 
ing of how the complex sensation of pain is 
encoded in the nervous system and providing 
exciting new avenues for therapeutic inter- 
vention. Several key aspects of the nociceptive 
network and the sensation of pain, however, 
remain to be addressed. With respect to phys- 
iological sensory processing, major outstand- 
ing questions include: What is the identity of 
the noxious mechanotransducer(s)? What are 
the specific contributions of molecularly dis- 
tinct primary sensory neurons to modality 
coding? What is the logic of nociceptor coding 
of pain? How do spinal cord circuits contrib- 
ute to modality coding? What is the logic of 
the projection neuron populations? Which neu- 
ronal populations in the brain process the dif- 
ferent aspects of pain? In terms of pathological 
pain, major outstanding questions include: What 
are the neural networks for spontaneous pain? 
What are the peripheral and central components 
of the neural networks that underlie mechanical 
allodynia induced by different types of injuries, 
such as inflammatory and neuropathic? How do 
the neural circuits for pain differ as a function of 
chronicity? How does chronic pain manifest in 
the brain? Understanding the precise contribu- 
tions of the neurons and molecules that underlie 
each form of pain will greatly facilitate therapeutic 
drug development. 
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REVIEW 


Deconstructing the sensation of pain: 
The influence of cognitive processes 


on pain perception 


Katja Wiech’?* 


Phenomena such as placebo analgesia or pain relief through distraction highlight the 
powerful influence cognitive processes and learning mechanisms have on the way 

we perceive pain. Although contemporary models of pain acknowledge that pain is not 

a direct readout of nociceptive input, the neuronal processes underlying cognitive 
modulation are not yet fully understood. Modern concepts of perception—which include 
computational modeling to quantify the influence of cognitive processes—suggest that 
perception is critically determined by expectations and their modification through learning. 
Research on pain has just begun to embrace this view. Insights into these processes 
promise to open up new avenues to pain prevention and treatment by harnessing the 


power of the mind. 


atching a captivating film while a den- 

tist is fixing your tooth can help you 

endure the much-dreaded visit with sur- 

prisingly little pain. Cognitive processes, 

such as distraction, have the potential to 
change the way we perceive pain—for better or 
for worse, as I will show below. Based on a rich 
psychological literature, brain imaging studies in 
humans have sought to describe and character- 
ize the influence of cognitive factors on the neu- 
ral processing and perception of pain since the 
1980s. I will first give an overview of our current 
understanding of mechanisms and neural path- 
ways to cognitive pain modulation and highlight 
the most important recent strands of research 
in this field, with an emphasis on experimental 
studies in healthy individuals. In the second part, 
I will outline how these findings may be inte- 
grated with modern concepts of perception by 
using computational models to explore the in- 
fluence of cognition on pain at a more funda- 
mental level. 


The search for a “signature of pain 
in the brain” 


A modulatory influence of cognitive factors on 
the perception of pain has been documented 
for a number of processes including attention, 
anticipation, catastrophizing, (re-)appraisal, and 
perceived control over pain (7). Undoubtedly, the 
most impressive and most extensively studied 
example is a placebo analgesic response. Patients 
with agonizing levels of pain can report com- 
plete pain relief after administration of a sugar 
pill they think is a powerful painkiller. But do such 
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changes in pain reports indeed reflect a change 
in pain perception and neural pain processing— 
or just a report bias? Answering this question is 
notoriously difficult given the highly subjective 
nature of pain. Attempts to identify an objective 
readout of pain based on brain imaging data 
are controversially discussed (2). Brain activation 
induced by noxious input might in fact not be 
pain-specific but also reflect processes that are 
(inherently) linked to pain, such as the detec- 
tion of salient events (3). Historically, the network 
of brain regions involved in pain processing 
(“pain matrix”) has been divided into sensory- 
discriminative and cognitive-affective systems. 
The sensory-discriminative system, which in- 
cludes the lateral thalamus and primary and 
secondary somatosensory cortex (SI and SII, re- 
spectively), was thought to process nociceptive 
input, including its intensity, localization, and 
quality. In contrast, the cognitive-affective sys- 
tem, comprising regions such as the anterior 
insula and anterior cingulate cortex, was im- 
plicated in psychological aspects of pain. How- 
ever, this strict dichotomy turned out to be an 
oversimplification, as sensory-discriminative brain 
regions, for instance, can also be sensitive to 
cognitive processes. Moreover, studies using a 
decoding approach (i.e., the prediction of pain 
perception based on activation patterns in the 
brain) demonstrated that the prediction is sig- 
nificantly improved when the different brain re- 
gions are considered together. In the so-far most 
rigorous attempt to characterize the “neurolog- 
ical pain signature” (NPS), Wager and colleagues 
(4) used a machine-learning algorithm to predict 
the perceived intensity of experimentally induced 
heat pain in healthy volunteers. The identified 
network—comprising brain regions such as thal- 
amus, SI, SII, anterior insula, and anterior cingulate 
cortex—afforded a specificity of about 90% in dis- 
criminating physical pain from related phenome- 
na (e.g., “pain” due to social exclusion). Although 
these first findings are encouraging, further 
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validation of this “objective marker” is needed, 
including its translation to clinical pain and 
generalizability to different types of pain and 
pain modulation. 


The descending pain control system: 
Top-down modulation of pain 


Neuroimaging studies have not only been con- 
cerned with target regions of cognitive pain 
modulation but also with areas that implement 
the modulation. The so-called descending pain 
control network comprises regions such as the 
dorsolateral prefrontal cortex (DLPFC), rostral 
anterior cingulate cortex, and periaqueductal 
gray (PAG) (5). Activation and functional con- 
nectivity between these regions are positively 
correlated with the level of pain relief reported. 
The engagement of this modulatory control net- 
work has been linked to reduced activation in 
other pain-related brain regions, albeit with vary- 
ing consistency. Furthermore, the top-down in- 
fluence has been shown to alter responses in 
the spinal dorsal horn, which suggests that it 
can modulate nociceptive processing at an early 
stage (6). The DLPFC is thought to play a piv- 
otal role in top-down control of pain, because 
its transient lesioning by transcranial magnetic 
stimulation (TMS) abolish the placebo analgesia 
(7). Most of our knowledge of the pain-control 
system stems from neuroimaging studies on 
placebo analgesia, but this system has also been 
found to be engaged during other types of cog- 
nitive operations leading to pain reduction [e.g., 
distraction (8)]. Descending pain inhibition is 
largely mediated through endogenous opioids 
(9). There is, however, evidence for the contribu- 
tion of other neurotransmitters, including can- 
nabinoids (70) and dopamine (17). Taken together, 
research on the descending pain control system 
has described a network that is sensitive to cog- 
nitive manipulations and can interact with other 
brain regions involved in pain processing. 


The frontostriatal system: Valuation of 
nociceptive input and higher-order 
integration of different aspects of pain 


Do changes in perception based on cognitive 
modulation differ from those induced by changes 
in noxious input? Woo and colleagues (12) direct- 
ly compared the modulation of pain through 
different intensity levels of heat and through 
cognitive self-regulation of pain in the same in- 
dividuals. Although the former was indeed re- 
flected in changes in the NPS, self-regulation had 
no effect on the NPS but was associated with 
changes in functional connectivity (i.e., the cross- 
talk between brain regions) of mesolimbic brain 
structures, including the ventromedial prefrontal 
cortex (vmPFC) and nucleus accumbens (NAc). 
This finding is remarkable for two reasons. First, 
it challenges the concept of the NPS as a universal 
signature of pain in the brain. If the NPS is to be 
established as an objective readout of pain, it is 
expected to reflect changes in pain irrespective 
of the type of modulation that led to the change 
in perception. Second, it highlights the contri- 
bution of the mesolimbic network that has been 
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Fig. 1. Influence of expectations on pain. 


Diffusion decision model 


Accumulator model within the framework of perception as inference to describe the decision in a binary decision- 
making task. The decision can be understood as an accumulation of evidence that initiates a response as soon as the 
decision threshold for one of the options is reached. Without prior information, the probability of both alternatives is 
identical (thin center line). If high-intensity pain is expected, this prior information could lead to altered sensory 
processing (dashed line) or a shift in prior toward the high-pain boundary (dotted line). 
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Influence of prior information on perceptual 

decision-making in the context of pain (30). 

Participants saw one of three visual cues on 

each trial, and these signaled different 20/80 
probabilities for low-intensity and high-intensity or 

pain: (i) 80% high, 20% low; (ii) 20% high, q fy 
80% low; or (iii) 50% high, 50% low. 
Participants had to indicate as quickly as 
possible whether they received a high- or 
low-intensity stimulus. Response time and 
accuracies were recorded. 
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implicated in learning and valuation 
rather than in pain processing as such. 
It could therefore be speculated that 
this network translates sensory, cog- 
nitive, and affective aspects of pain 
into a “common currency” to inte- 
grate them and give rise to one uni- 
fied pain experience. In a longitudinal 
study involving brain imaging (13), 
functional and structural character- 
istics of vmPFC and NAc have been 
shown to predict the development of 
chronic pain. On the basis of these 
findings, it has been postulated that 
the frontostriatal system is key not 
only for the conversion of nociception 
into the perception of pain but also 
for the transition from acute to chro- 
nic pain (/4, 15). 


Confirms 


Attention and the influence of 
spontaneous brain activity on 
pain perception 


Changes in pain perception have not 
only been observed following delib- 
erate cognitive operations but can 
also occur spontaneously (16). Re- 
cent work linked this finding to spon- 
taneous fluctuations in attention to 
pain that depend on dynamic changes 
in resting state activity in three dis- 
tinct brain circuitries (17): (i) the “sa- 
lience network,” which is involved in the detection 
of biologically relevant stimuli and events and 
comprises brain regions such as mid-cingulate 
cortex, anterior insula, temporoparietal junction, 
and DLPFC (/8); (ii) the default mode network 
(DMN), which shows a reduced signal level dur- 
ing activity compared with a relaxed nontask 
state and includes regions such as the medial 
prefrontal cortex, the posterior cingulate cortex 
and precuneus, the lateral posterior lobe, and 
the medial temporal lobe (19); and (iii) the de- 
scending pain control system described above. 
Using an experience sampling approach in which 
participants indicate to which extent they had 
paid attention to pain, activation in the salience 
network was found when attention spontane- 
ously focused on pain (20). In contrast, the DMN 
was engaged when attention was focused away 
from pain (20). “Individuals’ intrinsic attention 
to pain” (defined by the test-retest reproduci- 
bility of an individual’s tendency to attend away 
from pain) was related to their structural and 
functional connectivity between DMN and the 
descending pain control system [and the PAG 
in particular (78)]. Also, alterations in the inter- 
play between the salience network, DMN, and 
descending pain control network have been 
related to heightened attention to pain in chro- 
nic pain patients (77). Although speculative at 
this time point, research into the neural basis 
of altered spontaneous focusing on pain might 
therefore also be relevant for understanding 
the “interruptive function of pain” (22) on con- 
comitant cognitive processes in clinical pain 
populations. 
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The need for a comprehensive 
unified framework of cognitive 
pain modulation 


The studies portrayed above have provided im- 
portant insights into the implementation of cog- 
nitive pain modulation. They leave unanswered, 
however, critical questions regarding its initiation, 
maintenance, and integration. The processes 
described in the previous sections need to be 
carefully orchestrated to integrate momentary 
demands with long-term goals (22) to ensure 
survival of the organism. What triggers cogni- 
tive pain modulation and what prevents or stops 
it? Furthermore, the actual interface between 
cognitive processes and pain as a perceptual ex- 
perience has only insufficiently been described 
so far. How are cognitive influences woven into 
the perceptual process that gives rise to the ex- 
perience of pain? 


The construction of a pain experience: 
“Perception as inference” 


Modern concepts of perception outside the pain 
field have begun to address these questions by 
using computational models. In computational 
modeling, measurable indices of behavior that 
result from the inferential process (e.g., catego- 
rization of stimuli as painful, versus nonpainful, 
response times) are used to inform a theoretical 
model, which maps sensory input (e.g., nox- 
ious stimulation) onto behavior. Indicators of this 
mapping are subsequently used in the analysis 
of functional neuroimaging data to characterize 
the inferential process. In the most prominent 
theoretical framework of this lineage, termed 


Expectation | sensory info 
Generates 
ve error signal 
at rule 


Fig. 2. Prediction error processing and learning in the context of pain: 
A schematic overview. Sensory input or pain-related cues trigger a pain- 
related expectation. Subsequently, nociceptive input is compared with the 
expectation that reflects prior information. If incoming information is in line 
with prior assumptions, the expectation is confirmed. If they diverge, a pre- 
diction error signal is generated, and the expectation is updated through 
a learning rule. Note that the generation of a prediction error might not 
necessarily lead to a revision of the expectation; following up on prediction 
errors might selectively be impaired in a pathological state and might 
contribute to aberrant learning in the context of pain. 


“predictive coding,” perception is 
conceptualized as an inferential pro- 
cess in which prior information is 
used to generate expectations about 
future perception and to interpret sen- 
sory input (23). During the perceptual 
process, incoming sensory informa- 
tion is compared against a “template” 
or expectation that reflects prior in- 
formation. The concept of predictive 
coding acknowledges that we are 
more likely to perceive sensory in- 
formation in accordance with our 
template than with competing inter- 
pretations. Perception is thereby un- 
derstood as a process that favors 
expected outcomes and weighs down 
information that is incongruent with 
the expectation. Evidence for such 
biased perception in the context of 
pain comes from studies using ex- 
plicit cues to signal the intensity or 
onset of an upcoming noxious stim- 
ulus (24, 25) (Fig. 1) or, in more com- 
plex paradigms, the predictability or 
controllability of the stimulation [e.g., 
(26)]. Moreover, the generation of ex- 
pectations is a shared feature of most 
cognitive processes that have been 
related to pain modulation, despite 
their many differences (27). Biased 
perceptual inference has recently been 
postulated to contribute to various diseases, in- 
cluding functional motor and sensory syndromes 
and psychiatric disorders (28). 

Studies in animals and humans have begun to 
unravel neural mechanisms underlying the infer- 
ential process and to characterize the influence 
of prior information and expectations (29). Col- 
lectively, they show that expectations can bias 
perception by introducing changes not only in 
sensory brain regions but also in those involved 
in interpreting the incoming information. This 
concept extends the traditional view that a cog- 
nitive influence has to be implemented in a top- 
down fashion (as, for instance, reflected in the 
concept of the descending pain control system) 
by emphasizing the relevance of higher-order cor- 
tical processes that translate incoming informa- 
tion into perception. In line with this notion, it 
was recently shown that biased decision-making 
can explain the influence of prior expectations 
on the perception of pain (30). In sum, the con- 
cept of “perception as inference” allows for the 
integration of cognitive factors into the percep- 
tual process itself and highlights the relevance 
of expectations for perception formation. 


Updates 


Learning and updating internal models 
about pain 


If the influence of expectations on perception 
is so profound, why do we not simply foster the 
most extreme expectations of radical pain relief 
as part of any pain treatment? Although the 
influence of expectations is undoubtedly strong, 
there are clearly limits to the extent expectations 
can influence the perception of sensory signals 
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(31. Our representations of reality should first 
and foremost enable us to successfully navigate 
the environment with minimal costs, which ren- 
ders delusional ideations impractical. A substantial 
deviation of our expectations from reality should 
therefore lead to course correction—an updat- 
ing or revision of our expectations. Predictive 
coding and learning models rooted in this ap- 
proach assume that when expected and observed 
sensory information diverge, a “prediction error 
message” is generated in the brain that serves 
as a teaching signal for model updating (Fig. 2). 
So far, prediction error (PE) signaling has been 
studied extensively in the context of reward lear- 
ning and perceptual decision-making in the vis- 
ual and auditory systems. However, a limited 
number of studies have explored aversive PE pro- 
cessing by using noxious stimuli as an aversive 
experience during learning in humans (32-35). 
But what constitutes an important enough 
deviation to challenge our current model and 


“The concept of pain as 
an actively constructed 
experience...has 
far-reaching implications 
for pain treatment and 
prevention.” 


trigger updating? Both premature and delayed 
model updating can be risky. Premature or overly 
frequent updating might lead to a highly volatile 
and unreliable model and also absorb large amounts 
of attentional resources for constant monitoring. 
Conversely, delayed model updating might bear 
the risk that true changes are not considered early 
enough to prevent costly erroneous decisions. In- 
dividuals may optimally integrate newly available 
information into their existing model when short- 
lasting noxious stimuli are applied under controlled 
experimental conditions. There is, however, ample 
evidence that information integration and learning 
are suboptimal in patients suffering from chronic 
pain (36), and lead to change-resistant mental 
representations of pain and delayed updating. 
Premature updating of expectations, on the 
other hand, might be particularly detrimental 
when the maintenance of positive expectations 
could aid treatment success and their (premature) 
downward corrections might compromise the 
outcome. As with placebo analgesia, the experi- 
ence of insufficient pain relief during treatment 
might be irreconcilable with the expectation of 
a successful treatment—and, therefore, lead to 
model updating (37). Treatment expectations 
are—if at all—often only assessed once before 
treatment onset, but it seems reasonable to 
assume that expectations are modified if the 
treatment effect falls short of pretreatment 
expectations. This downward adjustment may 
not only cause patients to drop out of treatment 
programs but might directly prejudice treatment 
success, because the inferential processes of pain 
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perception and treatment judgment are no longer 
supported by a positive expectation of pain relief. 
Critically, negative treatment experiences have a 
prolonged effect that can also hamper subsequent 
unrelated treatment (38). In sum, learning mod- 
els characterize the modification of expectations 
when new information becomes available and 
could be applied to explore aberrant learning that 
is frequently found in the context of chronic pain. 


Conclusions and outlook 


With the departure from a rather rigid concept 
of pain as a direct readout of sensory input, 
neuroimaging of cognitive pain modulation has 
provided valuable insights into the complex na- 
ture of pain and its neural basis. However, fur- 
ther research is needed to integrate the various 
efforts into a coherent model that addresses all 
aspects of modulation, including its initiation, im- 
plementation, and monitoring, and also includes 
the existing behavioral and neuroimaging litera- 
ture (39). The concept of predictive coding could 
provide the theoretical framework for this en- 
deavor. For example, Biichel and colleagues (37) 
pointed out that although the descending pain- 
control system is commonly interpreted as a top- 
down influence, its constituting brain regions 
have reciprocal connections, which allow for up- 
and downward projections of information. They 
therefore suggest that—in line with the concept 
of predictive coding—the descending system 
could be part of a larger recurrent network ex- 
changing PE signals at all levels of the neuraxis. 
Several findings point at a critical role of the 
DLPFC in orchestrating this network. As described 
above, the DLPFC is part of various networks that 
are involved in cognitive pain modulation. It plays 
a key role in evidence accumulation during per- 
ceptual decision-making, as described in the con- 
text of perception as inference (40) and learning. 
The integration of these different strands of re- 
search with respect to DLPFC functioning and 
its governance of learning networks can be ex- 
pected to provide the much-needed unifying 
model of neural mechanisms underlying cog- 
nitive pain modulation. 

The concept of pain as an actively constructed 
experience that is determined by expectations 
and their modification through learning has far- 
reaching implications for pain treatment and 
prevention. Treatment success is known to be 
critically depending on patients’ expectations, 
not only in the context of placebos but also with 
active interventions such as analgesic drugs (41). 
Expectations—in turn—are shaped by the infor- 
mation that is provided by health care practi- 
tioners. How could information be designed to 
optimally guide expectations for maximum treat- 
ment outcome? How could aberrant information 
processing be addressed using the framework of 
predictive coding? Future research should explore 
the translation of research on the inferential 
process underlying the perception of pain into 
clinical practice to optimally inform pain pre- 
vention and treatment strategies. 

Patients’ complaints about pain that persists 
despite numerous treatment attempts are often 


dismissed as being “all in their head.” Modern 
pain research has shown that this notion is in 
fact true for any kind of pain, acute or chronic, 
easy to treat or resistant to all treatments cur- 
rently available. We are only beginning to under- 
stand that the head (or the brain, for that matter) 
also holds the key to new ways to help patients 
conquer their pain. 
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SPECIAL SECTION 


PAIN RESEARCH 


REVIEW 


Exploring pain pathophysiology 


in patients 


Claudia Sommer* 


Although animal models of pain have brought invaluable information on basic processes 
underlying pain pathophysiology, translation to humans is a problem. This Review will 
summarize what information has been gained by the direct study of patients with chronic 
pain. The techniques discussed range from patient phenotyping using quantitative 
sensory testing to specialized nociceptor neurophysiology, imaging methods of peripheral 
nociceptors, analyses of body fluids, genetics and epigenetics, and the generation of 
sensory neurons from patients via inducible pluripotent stem cells. 


hen we speak of pain, we understand it 

as a deeply human experience, because 

it involves not only nociception and the 

immediate physiological reactions to 

it, but also emotional, cognitive, and so- 
cial consequences. Therefore, the ideal subject 
for the study of pain is the human being, and 
the best animal experiments or in vitro studies 
can only give information about partial aspects 
of the phenomenon of pain. However, pain re- 
search in humans, and even more so in patients 
with pain, is greatly restricted, for a number of 
reasons. Despite ever-improving imaging and 
neurophysiological methods, our access to the 
human organism is limited, and ethical restric- 
tions apply to all manipulations involving volun- 
teers or patients. Human beings are exposed to a 
myriad of influences, both over a lifetime and 
at any given moment, which makes standard- 
ization much more difficult than in an inbred 
rodent colony. With all these caveats, the study 
of pain pathophysiology in humans has made 
great progress in recent years. Some of these 
findings, including work by the author’s group, 
will be highlighted in this Review with a focus 
on neuropathic and other chronic pain and on 
such findings that have already led to a change 
in patient management or are likely to do so. 


Sensory phenotyping—not all pain is alike 


Neuropathic pain is defined as pain due to a 
lesion or disease of the somatosensory system 
(1). It has been hypothesized that the pattern of 
sensory loss and gain caused by such a lesion 
might reflect the underlying pathophysiology 
and give guidance toward individualized ther- 
apy. Quantitative sensory testing (QST) is a 
psychophysical method that uses a battery of 
sensory stimuli with predetermined physical 
properties following specific protocols (2, 3). QST 
is thus able to capture and quantify stimulus- 
evoked negative and positive sensory phenomena 
(Fig. 1). Physical stimuli are transduced by ac- 
tivating specific receptors, and resulting phys- 
iological signals are then conducted in specialized 
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peripheral nerve fibers and central pathways. 
The nerve fibers involved have unique physiologic 
properties, such that a sensory deficit or hyperphe- 
nomenon detected by QST should allow con- 
clusions as to which type of nerve fibers or their 
central connections are affected. 

QST has uncovered involvement of small nerve 
fibers (i.e., nociceptors) in a number of disor- 
ders where it was unexpected, such as Parkinson’s 
disease, neuropathies that so far have been con- 
sidered to be of the large-fiber type only, and 
fibromyalgia syndrome (4, 5). It has been in- 
ferred from such findings that primary or sec- 
ondary peripheral nociceptor involvement may 
be part of the pathophysiology of pain in these 
disorders. In diseases where nociceptor mal- 
function was already known to play a role, QST 
has helped to identify patterns—i.e., sensory 
profiles—that may give clues toward the individ- 


ual pathophysiology of pain and the potential 
treatment response. For example, the antiepileptic 
oxcarbazepine was more effective in reducing 
neuropathic pain in patients with preserved (and 
potentially hyperactive) nociceptor function, ac- 
cording to QST, than in patients with loss of 
nociceptor function (6). By contrast, a topical 
lidocaine patch, expected to work well if it finds 
active nociceptors to work on, was equally effi- 
cient in nociceptor-deprived skin (7, 8), whereas 
one recent study came to the opposite conclu- 
sion that lidocaine was more efficacious in skin 
with the QST features of overactive nociceptors 
(9). Another topical treatment, the 8% capsa- 
icin patch, seems to work better in patients with 
QST signs of hyperalgesia, whereas indicators 
for small-nerve fiber loss had no predictive value 
(10). In a recent analysis of 361 patient records 
from trials with antidepressants and anticonvul- 
sants, the usefulness of sensory phenotyping by 
QST for decisions on individualized treatment was 
found to be very limited (77). Many researchers 
and clinicians have adopted the QST protocol de- 
scribed by the German Research Network on Neu- 
ropathic Pain (DFNS), and this standardization has 
been of great value for multinational studies. 
Others have argued that by focusing on thresh- 
olds, the dynamic stimulus-effect relationship and 
suprathreshold-nociceptive processing are neglected 
in this protocol. Therefore, the issue of whether 
QST sensory threshold phenotyping helps guide 
individual treatment is still open and will be re- 
analyzed with more selective treatments and more 
comprehensive patient phenotyping. To understand 
pain pathophysiology, QST data are particular- 
ly valuable when put into relation with neu- 
rophysiological and molecular phenotyping. 


Patients with painful polyneuropathy 
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Fig. 1. A hypothetical example of two types of sensory profiles (clusters) typically found when 
analyzing QST data from large groups of patients (n = 150 in each group) with painful polyneuro- 
pathies. Positive z scores indicate positive sensory signs (hyperalgesia), negative z values indicate 
negative sensory signs (hypoesthesia, hypoalgesia). The shaded area indicates the 95% confidence 
interval for healthy subjects (-1.96 < z < +1.96). Red symbols indicate a cluster with mild sensory loss 
and gain in the mechanical parameters; blue symbols indicate a cluster with sensory loss. WDT, warm 
detection threshold; CDT, cold detection threshold; TSL, detection of temperature changes; CPT, cold 
pain threshold; HPT, heat pain threshold; PPT, pressure pain threshold; MPT, mechanical pain thresh- 
old to pinprick; MPS, mechanical pain sensitivity; WUR, wind-up ratio; MDT, mechanical detection 
threshold to von Frey hairs; VDT, vibration detection threshold. 
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Neurophysiology—nociceptor function 
can be captured, if properly done 
Classical sensory neurophysiology measures large 
nerve fibers and their afferent connections. This 
is true for standard nerve conduction studies 
and somatosensory evoked potentials. Special 
techniques are available to stimulate and record 
the activity of small nerve fibers and their af- 
ferent connections. In general, a pathological 
finding concerning evoked potentials does not 
provide information about where in the course 
from periphery to central nervous system (CNS) 
the lesion is located (Fig. 2). Among the neuro- 
physiological stimulation techniques selectively 
activating Aé and C fibers, the recording of laser- 
evoked potentials (LEPs) has the longest history 
and is best established (72, 13). A contact-free 
20-ms radiant-heat pulse is applied, which is 
absorbed by the upper 20 to 50 um of the skin. 
Some of the recent contributions of LEP studies to 
the pathophysiology of pain include support for 
spinothalamic pathway damage as a predictor 
for pain in patients with multiple sclerosis (14) 
and after stroke (15). 

Contact heat-evoked potentials (CHEPs) are a 
method for which normative data have recently 
been published (16, 17). CHEPs reflect As activity, 
but can be used to analyze C-fiber function using 
special techniques (78). It appears that CHEP 
amplitudes may reflect the reported magnitude of 
evoked pain (17). The “pain-related evoked po- 
tentials” (PREPs) are electrically evoked A6 fiber 
potentials (79, 20) that correlate with skin inner- 
vation in homogenous populations (9). 

In even more sophisticated setups, neuro- 
physiology combined with psychophysics has 
been used to uncover pain mechanisms—for 
example, using the concept of conditioned pain 
modulation (27). Here, a protocol testing tempo- 
ral summation and conditioned pain modulation 
was used. For temporal summation testing, a 
train of stimuli of constant intensity is admin- 
istered, and a typical increase in pain ratings by 
the patient along the series is expected. For con- 
ditioned pain modulation, heat pain ratings were 
taken from one arm, while the other arm was 
immersed in a hot water bath. Patients exhibiting 
increased ratings in the temporal summation pro- 
tocol and deficient conditioned pain modulation 
are supposed to be lacking endogenous pain 
inhibition, thus having a “pro-nociceptive” pat- 
tern of pain modulation (27). Interestingly, those 
patients who had deficient conditioned pain 
modulation responded best to duloxetine, a 
serotonin noradrenalin reuptake inhibitor, as- 
sumed to work by strengthening descending 
pain inhibition. Not surprisingly, patients with 
fibromyalgia syndrome had greatly impaired 
conditioned pain modulation (22). Thus, such 
complex neurophysiological tests may be valu- 
able tools for assessing pain pathophysiology 
and potential response to treatment. 

Microneurography uses a very different ap- 
proach. Here, a microelectrode is directly inserted 
into a nerve fascicle, in close contact to the axons, 
and spontaneous or evoked nerve discharges are 
recorded, such that the individual firing behavior 


SCIENCE sciencemag.org 


fMRI, evoked potentials 


Recording and imaging 
includes information from 
the periphery to spinal 
cord tracts 


fMRI, evoked potentials, QST 


Recording and imaging includes information 
from the whole neuraxis 


Brainstem reflexes 


studies 


information from 
the peripheral 
nerves 


Dorsal root 


Dorsal 
spinal 
cord 


Epidermis 


Microneurography, 
nerve conduction 


Recording provides 


Blink reflex, masseter inhibitory reflex, jaw jerk 
Gives information on brainstem excitatory and 
inhibitory circuits 


Skin biopsy 


Analysis gives information on local processes in skin 
Dermis 
Aé fiber 
C fiber 


Nerve fascicle 


iS 


Fig. 2. Schematic illustrating the information potentially gained from neurophysiological and 
skin biopsy studies in humans. When stimulating in the periphery and recording from the CNS, 
there is information about the entire course of the somatosensory system. Any abnormality detected 
cannot be localized to a particular neuroanatomic level. 


of C fibers can be assessed. Thus, whereas an ab- 
normality found in any of the evoked-potential 
techniques may arise from anywhere in the neur- 
axis, an abnormality found in microneurography 
arises from the studied axon on site. Using micro- 
neurography and advanced techniques such as 


threshold tracking in humans, researchers have 
recently shown spontaneous activity in nocicep- 
tors in diabetic and other painful neuropathies 
(23, 24). Spontaneous activity of C fibers was also 
shown in fibromyalgia syndrome (25), a disorder 
that was generally thought to be a CNS disorder. 
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Fig. 3. microRNAs (miRNAs) as an example of a “master switch” that could regulate numerous pain-related pathways at once. 


Moreover, the pathological C-fiber patterns (“slow- 
ing”) were different between fibromyalgia and 
small-fiber neuropathy (SFN), indicating distinct 
pathophysiological processes behind these disor- 
ders, both of which are associated with reduced 
skin innervation by C fibers (see below). 


Nociceptor histology and noninvasive 
nociceptor imaging—imaging pain 

in the periphery 

The experience of pain needs a brain, and thus 
brain imaging with its different modern tech- 
niques, such as brain and spinal cord functional 
magnetic resonance imaging, voxel-based mor- 
phometry, quantitative arterial spin labeling, 
near-infrared spectroscopy (NIRS), and advanced 
positron emission topography, provide valuable 
information. These techniques are beyond the 
scope of this article and will in part be covered 
in an accompanying article (26). Here, I will sum- 
marize the methods that allow the very distal 
parts of the nociceptors to be visualized. 

The analysis of nociceptors in skin punch biop- 
sies has moved the field forward considerably. 
Like functional aspects of skin innervation—as, 
for example, assessed by QST—morphological 
quantification of skin innervation has been used 
to find predictors for drug responses. Local in- 
jections of botulinum toxin were more effective 
in patients with neuropathic pain with preserved 
skin innervation than in those with marked de- 
nervation (27). 

The whole complex of SFN, an often painful 
neuropathy with reduced temperature sensitiv- 
ity and burning feet but lack of classical neuro- 
pathy signs, could only be defined after these 
patients had been investigated with intraepi- 
dermal nerve fiber (IENF) quantification in skin 
biopsies (28). IENF quantification has helped 
define SFN (29), to identify length-dependent and 
non-length-dependent subgroups and point 
toward a cytokine-related pathogenesis of the 
pain in the latter (30). Small-fiber pathology in 
the skin has also been detected in disorders 
where it was not expected, thus extending the 
ideas on their pathophysiology. Examples in- 
clude complex regional pain syndrome type I 
(CRPS I) (30), peripheral arterial disease (32), 
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and fibromyalgia syndrome (5). The finding of 
decreased skin innervation in a subset of pa- 
tients with fibromyalgia is of particular inter- 
est, for several reasons. First, for decades, this 
syndrome had been regarded as a chronic pain 
disorder based on faulty pain processing in the 
CNS (33), or as a somatoform disorder (i.e., not 
based on organic disease at all). Second, the 
initial finding was rapidly replicated by other 
groups (34, 35) and supported by evidence for 
functional changes using QST and evoked po- 
tentials (5, 35), and in particular microneu- 
rography (25). Third, the finding led to a vivid 
discussion in the scientific community over wheth- 
er all disorders showing a reduced IENF den- 
sity should be called “small-fiber neuropathy.” 


“Chemosensitive or 
sensitized nociceptors 
increase their activity 

upon stimulation with the 
appropriate chemical or 
combination of chemicals...” 


Although this notion sounds attractive, there 
are some important counterarguments. The neuro- 
physiological characteristics of remaining, viable 
nociceptors have been shown to be different 
between the different disorders (25). The abnor- 
mally high slowing of C nociceptors in conduc- 
tion velocity when stimulated at 0.25 Hz was 
only present in fibromyalgia, not in SFN, indi- 
cating different mechanisms underyling the loss 
of nerve fibers in both conditions. This phenom- 
enon might be due to a thinner diameter of these 
axons (36). Showing a peripheral nervous system 
pathology in fibromyalgia patients, however, does 
not preclude a CNS component of the disorder, as 
postulated by others (33). Indeed, our own find- 
ings using NIRS to assess brain activity in fibro- 
myalgia patients upon painful stimuli point 
toward an additional central amplification (37). 

Although skin biopsy as a method to image 
nociceptors is minimally invasive, other, even 
less invasive methods have been developed. One 


of them is corneal confocal microcopy (CCM), 
which images nociceptors in the cornea with 
high resolution in real time and allows their quan- 
tification. The density of corneal innervation ap- 
pears to correlate with skin innervation at the 
leg (where it is usually assessed in patients with 
painful neuropathies) (38). In diabetic neuro- 
pathy, where CCM has been most extensively 
used, CCM is considered valuable for detecting 
and assessing early nerve damage (39). How far 
this can be generalized and what else can be 
learned from CCM in particular for the study of 
pain remain to be seen. In fibromyalgia, as in skin 
biopsy studies, small-fiber pathology was de- 
tected in about 50% of patients by CCM (40). Dif- 
ferent subgroups of fibromyalgia patients could 
be identified with a combination of CCM param- 
eters and sensory testing, which may lead to more 
personally tailored and efficient treatment. 


Pain mediators in body fluids and 
tissue—reviving the “inflammatory soup”? 


Chemosensitive or sensitized nociceptors in- 
crease their activity upon stimulation with the 
appropriate chemical or combination of chem- 
icals, once termed “inflammatory soup” (47). The 
increase of neuronal activity in in-vitro prepara- 
tions and in experimental animals after appli- 
cation of algesic mediators such as prostaglandins 
and proinflammatory cytokines is well estab- 
lished. In humans, one of the questions is, which 
body fluid or tissue should be investigated to 
correlate levels of such mediators with measures 
of pain? Then, if such a correlation is found, there 
remains the question of whether this is a causal 
relationship or a chance finding. Longitudinal 
studies and, in particular, studies with inter- 
ventions directed at the mediator in question 
may provide evidence for one or the other. Having 
pursued the subject of inflammatory mediators 
in patients with neuropathic pain over many 
years, our group could show such correlations 
in some but not in all situations tested. A cohort 
of patients with chronic widespread pain had a 
systemic proinflammatory cytokine profile with 
reduced blood concentrations of anti-inflammatory 
cytokines (42). Local cytokine expression in the 
skin was not different in fibromyalgia compared 
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to controls (43). Unexpectedly, in patients with 
postherpetic neuralgia, systemic cytokine profiles 
were not different from those of controls, and lo- 
cal cytokine concentrations in affected skin were 
not different from those on the unaffected side 
(44). Systemic elevations of proinflammatory cyto- 
kines were found in patients with complex 
regional pain syndromes (CRPS) (45) and with 
painful (but not in painless) peripheral neuro- 
pathy (46). In length-dependent SFN, a biopsy 
specimen from the distal, painful skin harbored 
an increased expression of proinflammatory cy- 
tokines compared to proximal, nonpainful skin 
(30). Analyzing cytokine profiles in local blister 
fluid from patients with CRPS, we 
found bilaterally increased proin- 
flammatory tumor necrosis factor- 
o and macrophage inflammatory 
protein- 168 and decreased anti- 
inflammatory interleukin-1 (IL-1) 
receptor antagonist protein concentra- 
tions compared to non-CRPS patients 
(47). After 6 months of analgesic 
treatment, the increased cytokine 
protein levels in CRPS patients re- 
verted to the level of non-CRPS pa- 
tients. Such findings, similar to those 
by Wang et al. (48), who showed an 
increase in systemic IL-8 in pa- 
tients with fibromyalgia that was 
normalized after a course of multi- 
disciplinary therapy, support an in- 
terpretation of a pathophysiological 
importance. However, comparison 
between studies is hampered by the 
wide range of methods used. Fur- 
thermore, in contrast to data from 
animal studies, these correlations are 
far from providing proof of a causal 
relationship. Intervention studies di- 
rected at the respective mediators 
would be needed to show a causal 
link. Unbiased modern methods like 
proteomics may provide a further 
solution to this problem (49, 50). 
A factor that may have a major im- 
pact on the balance of pro- and anti- 
inflammatory processes is the gut 
microbiome. Bidirectional influences 
between gut microorganisms, the 
cytokine balance, stress and the 
hypothalamic-pituitary-adrenal axis, 
and pain have been postulated (57). 


Pain is in the genes—but what 
manipulates them? 


Large-scale genomics analyses of pa- 
tient samples have thus far been 
disappointing. By contrast, mono- 
genetic pain disorders, such as the 
diseases associated with mutations 
in voltage-gated sodium channels 
(Na,), erythromelalgia, paroxysmal 
extreme pain disorder, congenital 
insensitivity to pain, and painful he- 
reditary SFN, have been immensely 
instructive as models of human pain 
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disorders (52). Gain-of-function mutations of 
Nay1.7 ion channels underlie erythromelalgia and 
paroxysmal extreme pain disorder, and a loss-of- 
function mutation in the same gene induces 
congenital analgesia. Although these disorders 
are rare, up to 30% of a cohort of patients with 
idiopathic SFN have mutations in the genes for 
the sodium channels Na,1.7, Na,1.8, and Na,1.9 
(52). Thus, the important role of voltage-gated 
sodium channel activity in nociceptors for pain 
perception has been convincingly shown in hu- 
mans, and detailed information on the malfunction 
of the channel in different mutations has been 
gained (53). 


Somatic cells from patients 


Maintain 


Induced pluripotent stem cells 


Neural progenitor cells 


' 


Neurons 


Disease modeling 


—— 


Fig. 4. Somatic cells from patients can be reprogrammed to neuronal 
or glial cells, which can be used to study pathophysiology or screen 
for drug treatment. 
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A single-nucleotide polymorphism in the SCN9A 
gene [rs6746030, substitution of arginine-1150 
with tryptophan (R1150W)] that is present in 
~18% of Caucasians (54, 55) may predispose to 
increased pain sensation. Further studies have dem- 
onstrated an association of this polymorphism 
with pain in Parkinson’s disease, with multiple 
regional pain syndromes, and with pain in in- 
terstitial cystitis. Because not every person with 
this polymorphism develops a pain disease, addi- 
tional factors influencing the phenotype have 
been postulated. We described a family with 
exercise-induced pain and the RI50W polymor- 
phism that in addition had a distal demyelinat- 
ing sensory neuropathy (56). In 
these patients, as in patients with 
erythromelalgia, a sodium channel- 
blocking drug was an effective 
analgesic. 

Human diseases caused by muta- 
tions in the genes for nerve growth 
factor (NGF) or its tyrosine kinase 
receptor TrkA illustrate the impor- 
tance of NGF for pain perception. 
These rare autosomal recessive muta- 
tions lead to congenital insensitivity 
to pain or decreased pain percep- 
tion (57). The disorders are termed 
hereditary sensory and autonomic 
neuropathy (HSAN) type IV and type 
V. HSAN-IV, also called congenital 
insensitivity to pain with anhidrosis 
(CIPA), is caused by mutations in the 
gene coding for TrkA. In this severe 
congenital disease, the affected child- 
ren do not respond to painful stim- 
uli, which may lead to mutilations, 
Charcot arthropathy, osteomyelitis, 
and autoamputation. HSAN-V is 
caused by mutations in the NGF 
gene. Accordingly, drugs blocking 
NGF have strong analgesic actions 
in clinical trials on osteoarthritic 
and neuropathic pain (58). 

Sometimes a single mutation in a 
pain-relevant gene within one fam- 
ily may lead to quite different pain 
phenotypes (59). Epigenetic modi- 
fication of genes is one factor that 
explains such variability. In animal 
experiments, DNA methylation cor- 
related between brain and periph- 
eral blood cells (60), which encourages 
the use and interpretation of methyl- 
ation in human blood cells as sur- 
rogates. Indeed, a twin study showed 
a relation between methylation of 
the pain-related ligand-gated ion 
channel TRPAI gene, where higher 
methylation and lower expression of 
the gene was correlated with higher 
pain sensitivity (61). Methylation of 
the promoter domain of the en- 
dothelin 1B receptor gene was shown 
in patients with oral squamous cell 
carcinoma, but only if these were 
painful (62). Many more examples 


4 NOVEMBER 2016 + VOL 354 ISSUE 6312 591 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


SPECIAL SECTION 


PAIN RESEARCH 


of methylation of pain-related genes exist in the 
animal literature, but human studies are lag- 
ging behind. 

With the exception of rare monogenetic dis- 
eases like the ones mentioned above, changes in 
one gene or its product often do not have a major 
effect on pain perception. Thus, researchers have 
been looking for “master switches” that would 
regulate multiple gene products and orchestrate 
multiple pathways, acting on circuitries rather than 
on individual receptors or targets. MicroRNAs have 
the potential to function as such master switches 
(Fig. 3) and have been suggested to link pain and 
inflammation (63). MicroRNA “signatures” have 
been proposed for several pain diseases such as 
neuropathic pain (64) and fibromyalgia (65, 66). 
Again, such human microRNA profiles can pro- 
vide hypotheses, but causal connections can only 
be tested in animal models, until specific drugs 
for human use become available. 


Re-creating patient cells and tissue 


The possibility of creating patient neurons from 
pluripotent stem cells (PSCs) out of patient blood 
cells or tissue cells (for example, skin fibroblasts) 
has opened entirely new horizons for patient- 
related research on pain pathophysiology (67) 
(Fig. 4). Because the induced PSCs (iPSCs) and 
the neurons created from them retain the ge- 
netic identity of the donor patient, these cells 
can be used to study various aspects of disease 
pathophysiology. A striking example is the recent 
report on iPSC-derived sensory neurons from 
patients with inherited erythromelalgia caused 
by a SCN9A mutation (68). These neurons, like 
the patients, had hyperexcitability and aber- 
rant responses to heat stimuli. Furthermore, 
the heat hypersensitivity could be reversed by 
a blocker of the mutated ion channel, Nay1.7. 
The same drug had a clinical effect on the pa- 
tients’ pain symptoms. 

iPSCs have also been created from patients 
with Fabry’s disease (69), a monogenetic disor- 
der with a characteristic pain phenotype, and the 
potential to mature them into sensory neurons 
will help to elucidate the pain mechanism. The 
same option is certainly available for other mono- 
genetic pain disorders, and such neurons will lend 
themselves to high-throughput screening methods 
for modulators of neuropathic pain. The tech- 
nique is still new, and major challenges need to 
be overcome. Apart from the technical issues and 
the question of standardization between labora- 
tories, iPSC-derived neurons will have to display 
a measurable disease-specific phenotype with a 
stable expression across passages in order to be 
useful as a disease model and potential drug- 
screening platform. 

Organotypic cultures, thus far realized using 
human skin cells and rodent nerves to inner- 
vate them (70), may provide a further step 
toward human disease models, preferably when 
constructed entirely out of patient cells. Such 
organotypic cultures can be studied with high- 
throughput microscopy, proteomics, and electro- 
physiology, and even optogenetic methods may 
be applied (72). 
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Conclusions—What we know and what 
we would like to know 

Ideally, we would like to understand the under- 
lying pathophysiology of pain in each patient on 
an individual basis. With the exception of the 
rare monogenetic diseases with overactive sodium 
channels, we are still far from this goal. 

Other monogenetic disorders associated with 
typical pain states like hereditary migraines, Fabry 
disease, sickle cell disease, and others, provide 
opportunities to uncover more of these mecha- 
nisms in model diseases. For the major pain 
diseases like osteoarthritis and low back pain, twin 
studies and population studies have shown that 
genetic risk factors can explain some of the 
individual differences in pain perception, but 
that environmental factors are equally impor- 
tant. These studies will continue to provide 
valuable information on pain risk factors. The 
yield from genome-wide association studies may 
improve when more uniform cohorts are included, 
and ethnicity and other variables are more string- 
ently considered. The field of gene-gene and 
gene-environment interactions is just beginning 
to be explored regarding human pain disorders. 
Epigenetic factors include and are influenced by 
diet, exercise, and the social environment. Pro- 
spective cohort studies may give insight into 
these interactions. 

We have tools for assessing sensory phenotypes, 
including pain thresholds and hypersensitivity, 
most of which have been well standardized with 
international agreement. Consequently, using 
these in clinical trials may help to identify re- 
sponder subgroups in trials that would other- 
wise have a negative outcome. In the research 
setting, more information may be obtained from 
individual patients by combining the threshold 
techniques with methods assessing suprathresh- 
old function and central pain processing. Com- 
bining such results with those from genomics and 
proteomics may push the field forward. Imaging 
methods of the brain and spinal cord will com- 
plement these findings. 

Methods for gaining information out of bio- 
materials from patients have improved consider- 
ably, including the generation of neurons out of 
iPSCs, organotypic cultures, and high-throughput 
methods. It can be expected that combined with 
technologies like neurophysiology, proteomics, 
and optogenetics, human in vitro models will 
lead to rapid advances in pain pathophysiology 
and drug response. 
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Keeping hearts and blood vessels young 


ctivation of the G protein-coupled receptor GPER is thought to confer cardiovas- 
cular benefits. Unexpectedly, Meyer et al. found that aged mice that were deficient 
in Gper did not develop as much cardiac fibrosis as aged mice in the control group 
and retained greater cardiovascular function. Gper deficiency was associated 
with reduced production of tissue-damaging superoxide in blood vessels and 
the myocardium. A GPER-blocking drug reduced blood pressure and superoxide 
production in hypertensive mice, suggesting that GPER inhibitors could be 
used to treat cardiovascular diseases caused by excessive superoxide 


generation. -WW 


Radicals push proteins 


beyond genes 

Chemically modifying pro- 

teins after their translation 

can expand their structural 

and functional roles (see the 
Perspective by Hofmann and 
Bode). Two related methods 
describe how to exploit free 
radical chemistry to form 
carbon-carbon bonds between 
amino acid residues and a 
selected functional group. 
Wright et al. added a wide range 
of functional groups to proteins 
containing dehydro-alanine 
precursors, with borohydride 
mediating the radical chemistry. 
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Yang et al. employed a similar 
approach, using zinc in com- 
bination with copper ions. 
Together, these results will be 
useful for introducing function- 
alities and labels to a wide range 
of proteins. —NW 

Science, this issue pp. 597 and 623; 

see also p. 553 


Exotic molecule tests 
fundamental symmetry 


Spectroscopy of exotic molecules 
can offer insight into fundamental 
physics. Hori et al. studied the 
transition frequencies of an 
unusual helium atom in which 


Clark et al., p. 606 


Superoxide generation 
visualized in a mouse aorta 


one of the two electrons was 
substituted by an antiproton, the 
negatively charged antiparticle 
partner of the proton (see the 
Perspective by Ubachs). The 
antiprotonic helium was cooled 
down to low temperatures to 
allow the frequencies to be mea- 
sured with high precision. The 
extracted mass of the antiproton 
(relative to the electron mass) 
was in good agreement with 
previous measurements of the 
proton mass. This finding is in 
keeping with the implications of 
the combined charge, parity, and 
time-reversal symmetry of physi- 
callaws. —JS 

Science, this issue p. 610; 

see also p. 546 
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Assessing smoke damage 
in cancer genomes 


We have known for over 60 years 
that smoking tobacco is one of 
the most avoidable risk factors 
for cancer. Yet the detailed 
mechanisms by which tobacco 
smoke damages the genome 
and creates the mutations that 
ultimately cause cancer are still 
not fully understood. Alexandrov 
et al. examined mutational 
signatures and DNA methylation 
changes in over 5000 genome 
sequences from 17 different 
cancer types linked to smoking 
(see the Perspective by Pfeifer). 
They found a complex pattern 
of mutational signatures. Only 
cancers originating in tissues 
directly exposed to smoke 
showed a signature charac- 
teristic of the known tobacco 
carcinogen benzo[a]pyrene. One 
mysterious signature was shared 
by all smoking-associated 
cancers but is of unknown origin. 
Smoking had only a modest 
effect on DNA methylation. 
—PAK 


Science, this issue p. 618; 
see also p. 549 


Cigarette smoking produces a variety 
of mutational signatures. 


593 


RESEARCH 


IN SCIENCE JOURNALS 


Rejuvenating viral 
vectors 


Adenoviral (Ad5) vaccine vec- 
tors elicit mixed responses: 
They induce protective CD8* 
T cells, but these cells may 
be partially exhausted. Now 
Larocca et al. demonstrate that 
this exhausted phenotype may 
result from Ad5 vector—induced 
antigen-specific CD4* T cells 
that express interleukin-10 
(IL-10) and PD-1 in both mice 
and macaques. These IL-10* 
CD4* T cells suppress the 
vaccine-induced CD8* T cell 
response, and their inhibitory 
function may depend in part on 
IL-27. Targeting this inhibitory 
pathway may thus enhance 
protection of viral vector— 
based vaccines. —AC 

Sci. /mmunol. 1, eaaf7643 (2016). 


Shrinking 
spectrometers 


Dual-comb spectroscopy is a 
powerful technique that uses 
the interference of two closely 
related combs to map spec- 
troscopic features directly into 
a frequency domain that can 
be read by electronics. Suh 
et al. developed a dual-comb 
spectroscopy approach using 
combs produced by silica 
microresonators fabricated 
ona silicon chip. Perhaps high- 
resolution spectroscopy will 
soon be shrunk to the chip 
scale, doing away with the need 
for bulky spectrometers. —ISO 
Science, this issue p.600 


Uranium in 
the deep sea 


The ratio of 4U to 2°U in 
seawater underlies modern 
marine uranium-thorium 
geochronology, but it is difficult 
to establish the ratio precisely. 
Chen et al. report two 234U/#58U 
records derived from deep-sea 
corals (see the Perspective 

by Yokoyama and Esat). The 
records reveal a number of 


important similarities to and 
differences from existing 
records of the past 30,000 
years. Higher values during the 
most recent 10,000 years than 
during earlier glaciated condi- 
tions may reflect enhanced 
subglacial melting during 
deglaciation. —HJS 

Science, this issue p. 626; 

see also p. 550 


Spinal cord regeneration 
in zebrafish 


Unlike humans, zebrafish can 
regenerate their spinal cord. 
Mokalled et al. identified a 
growth factor in zebrafish that 
helps this process (see the 
Perspective by Williams and 
He). The protein encoded by 
ctgfa (connective tissue growth 
factor a) is secreted after injury 
and encourages glial cells to 
form a bridge across the spinal 
lesion. Addition of this protein 
improved spinal cord repair in 
injured zebrafish. —PJH 

Science, this issue p. 630; 

see also p.544 


Soil microbes yield 
insecticidal peptide 
The microbial peptide BT, 
derived from the bacterium 
Bacillus thuringiensis, is widely 
used to protect crops from 
insect pests. Schellenberger 
et al. identified another 
insecticidal peptide from a dif- 
ferent soil-dwelling bacterium, 
Pseudomonas chlororaphis 
(see the Perspective by 
Tabashnik). Corn plants 
expressing the Pseudomonas 
peptide were protected from 
attack by western corn root- 
worm. Rootworms that were 
resistant to BT were suscep- 
tible to the Pseudomonas 
peptide. Addition of another 
insecticidal peptide diversi- 
fies the arsenal against insect 
pests, which may slow down 
the development of resistance. 
—PJH 

Science, this issue p. 634; 

see also p.552 
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PUBLIC HEALTH 


Worms remodel 


Edited by Caroline Ash 
and Jesse Smith 


immune responsiveness 


ural populations in less developed countries 
commonly show poor immunogenicity in 
vaccination programs. Helminth infestations 
remain common in some rural areas, and 
cellular immune hyporesponsiveness is a 
hallmark of chronic helminth infections. Community 
deworming programs are in general believed to be 
a good thing to reverse the morbidity that a large 
worm burden can impose on children. Wammes et 
al. set up a 2-year clinical trial to systematically test 
the immunological consequences of deworming 
in >1000 villagers in Indonesia. After treatment, 
subjects showed significant immune remodeling, 
with reduced expression of CTLA-4 (cytotoxin T 
lymphocyte—associated antigen 4) and elevated 
proinflammatory cytokine responses to malaria 
parasite antigens. The challenge in the longer term 
could be that restored immune responsiveness 
might increase the prevalence of inflammatory 


diseases. —CA 


Proc. Nat. Acad. Sci.10.1073/pnas.1604570113 (2016). 


Printing emulsions 
and foams in 3D 


The properties of porous 
materials can depend on the 
overall volume, size, and size 
distribution of the pores, but 
these can be difficult features to 
simultaneously control during 
fabrication. Emulsion or foam 
techniques can produce pores 
of only one narrow size range, 
but in combination with a direct 
ink-writing process, Minas et 

al. developed an approach to 
create porous materials with 


Aprinted cubic lattice scaffold for 
the creation of porous ceramics 


Published by AAAS 


highly tunable properties. 
Shear thinning inks made from 
modified alumina particles are 
progressively deposited to form 
an overall structure, where large 
pores are made by the writing 
process and small ones from the 
emulsion or foam, with pores 
either open or closed. A percolat- 
ing network of load-bearing 
struts results in structures with 
very high strength-to-weight 
ratios. -MSL 
Aav. Mater. 10.1002/adma.201603390 
(2016). 


Cognition, behavior, 
and the globus pallidus 


Deep inside the brain, the 
external segment of the globus 
pallidus receives many inputs 
from the neighboring basal 
ganglia. Its role in basic cognitive 
functions has rarely been mea- 
sured directly. While monkeys 
were learning a complex task, 
Schechtman et al. recorded 
discharges from prototypical 
neurons in the structure. The 


sciencemag.org SCIENCE 


PHOTOS (FROM TOP): EYE OF SCIENCE/SCIENCE SOURCE; MINAS ETAL., MATERIAL VIEWS (10 AUGUST 2016) ©2016 


PHOTO: SUSUMU NISHINAGA/SCIENCE SOURCE 


animals had to switch to a new 
stimulus-reward association once 
a predefined learning criterion 
had been reached. The activ- 
ity of the neurons predicted 
whether the monkeys would 
change or maintain their previous 
stimulus choice. Neuronal spike 
activity also encoded whether 
such choices were successful. 
When the animals’ performance 
became more automated and 
less demanding, discharge rates 
in the context of choice selection 
decreased. —PRS 

Proc. Nat. Acad. Sci. 10.1073/ 

pnas.1612392113 (2016). 


CELL BIOLOGY 
Microtubule repair 
and rescue 


The microtubule network that 
forms the cytoskeleton is 
essential for cell polarization and 
migration. Microtubules tend to 
grow slowly, and they can shrink 
rapidly in a process known as 
catastrophe. Aumeier et al. 
report that where the network is 


SCIENCE sciencemag.org 


Ascaris lumbricoides 
is acommon 
parasite of humans. 


dense, microtubules bend and 
cross over, leading to increased 
mechanical stress and filament 
damage. Damaged filaments 
can self-repair in a process 
that also prevents complete 
depolymerization. Most of the 
tubulin incorporates into the 
growing ends of microtubules, 
but some incorporates along 
the shaft of preexisting microtu- 
bules at damaged regions. This 
repair-and-rescue mechanism 
provides insight into how the 
network can remain stable in 
this highly dynamic system. — 
SH and SMH 


Nat. Cell Biol. 18, 1054 (2016). 


CHEMISTRY 
Sourcing hydrogen 


directly from wax 


Hydrogen is an appealing fuel 
because of the energy released 
in its reaction with oxygen and 
the nearly ideal safety profile 

of the sole product, water. The 
challenge lies in transporting 
and storing the hydrogen before 


its use. Gonzalez-Cortes et al. 
now show that simple paraffin 
wax can be an efficient source 
of hydrogen under the right 
circumstances. Specifically, they 
embedded carbon-supported 
ruthenium nanoparticles in the 
wax and then irradiated it with 
microwaves, thereby releasing up 
to half of the available gravimetric 
hydrogen content. The authors 
attribute the catalytic efficiency 
to a possible combination of local 
superheating and field-induced 
plasma formation. —JSY 

Sci. Rep. 10.1038/srep35315 (2016). 


EDUCATION 
What a (scientific) 
argument is not 


Engaging in arguments based 
on evidence is a practice found 
in STEM (Science, technology, 
engineering, and mathematics) 
education frameworks. Are 
students being taught to argue 
evidence in the same way that 
professional scientists do? 
MacPherson interviewed 10 
ecologists about arguments both 
current and ongoing in their field. 
These data were compared to 
ecological arguments portrayed 
in assessment tasks written for 
middle and high school science 
students. Ecologists discussed 
causal claims, whereas school 
science tasks instead focused 
on descriptive and prescriptive 
claims. This mismatch results in 
students not receiving an accu- 
rate picture of how professional 


ecologists argue. How can the 
STEM education community 
move forward with designing 
tasks that ask students to cri- 
tique evidence in a truly scientific 
way? —MM 

Sci. Ed. 10.1002/sce.21246 (2016). 


NEURODEVELOPMENT 
Layered haircut 
underlies hearing 


Hair cells deep within the ear 
transduce sound into hearing. 
On any single hair cell, a pack 
of stereocilia is neatly arranged 
from tallest to shortest. When 
the stereocilia do not develop 
adequately, deafness ensues. 
Studying mice, Tarchini et al. 
discovered some of the key sig- 
naling components that organize 
stereocilia during development. 
Two regulators of G proteins, 
the leucine-glycine-asparagine 
repeat protein and an inhibi- 
tory a-subunit of heterotrimeric 
G protein, coordinate to define 
the tallest row of stereocilia. 
Both regulators are expressed 
in the bare zone of the hair cell, 
a surface domain that will not 
produce stereocilia, and also in 
the very tips of the row of stereo- 
cilia adjacent to the bare zone. 
These stereocilia will emerge as 
the tallest of the crowd. If this 
signaling pathway is disrupted, 
stereocilia develop to more even 
and modest heights, and the 
animal is deaf. —PJH 

Development 10.1242/dev.139089 

(2016). 
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SCIENCE COMMUNITY 
Scientific impact— 
that is the Q 


Are there quantifiable patterns 
behind a successful scientific 
career? Sinatra et al. analyzed 
the publications of 2887 
physicists, as well as data on 
scientists publishing in a variety 
of fields. When productivity 
(which is usually greatest early 
in the scientist's professional 
life) is accounted for, the paper 
with the greatest impact occurs 
randomly in a scientist's career. 
However, the process of generat- 
ing a high-impact paper is not 
an entirely random one. The 
authors developed a quantita- 
tive model of impact, based 
on an element of randomness, 
productivity, and a factor Q that 
is particular to each scientist 
and remains constant during the 
scientist's career. —BJ 

Science, this issue p. 596 


SYSTEMS BIOLOGY 
Complex transcription 
factor interactions 


To respond to environmental 
changes, such as drought, plants 
must regulate numerous cellular 
processes. Working in the model 
plant Arabidopsis, Song et al. 
profiled the binding of 21 tran- 
scription factors to chromatin 
and mapped the complex gene 
regulatory networks involved 
in the response to the plant 
hormone abscisic acid. The work 
provides a framework for under- 
standing and modulating plant 
responses to stress. —LMZ 
Science, this issue p. 598 
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YEAST GENETICS 
A global genetic 
suppression network 


The genetic background of an 
organism can influence the 
overall effects of new genetic 
variants. Some mutations can 
amplify a deleterious phenotype, 
whereas others can suppress it. 
Starting with a literature survey 
and expanding into a genome- 
wide assay, van Leeuwen et al. 
generated a large-scale sup- 
pression network in yeast. The 
data set reveals a set of general 
properties that can be used to 
predict suppression interactions. 
Furthermore, the study provides 
a template for extending sup- 
pression studies to other genes 
or to more complex organisms. 
—LMZ 


Science, this issue p. 599 


OPTICAL PROCESSING 
Taking the pulse 
of optimization 


Finding the optimum solution 
of multiparameter or multifunc- 
tional problems is important 
across many disciplines, but 
it can be computationally 
intensive. Many such problems 
defined as computationally 
difficult can be mathematically 
mapped onto the so-called 
Ising problem, which looks at 
finding the minimum energy 
configuration for an array of 
coupled spins. Inagaki et al. and 
McMahon et al. show that an 
optical processing approach 
based on a network of coupled 
optical pulses in a ring fiber can 
be used to model and optimize 
large-scale Ising systems. Such 
a scalable architecture could 
help to optimize solutions to a 
wide range of complex problems. 
—|ISO 

Science, this issue pp. 603 and 614 


4 NOVEMBER 2016 * VOL 354 ISSUE 6312 


AUTOIMMUNITY 


TYK2’s balancing act 
Determining the biological con- 
sequences of the thousands of 
genetic variants that contribute 
to common diseases is chal- 
lenging. Genetic variants that 
influence autoimmune diseases 
have been identified in the gene 
encoding TYK2 (tyrosine kinase 
2), but conflicting evidence 
regarding their biological impact 
obscures the therapeutic poten- 
tial of TYK2. In resolving 

this conflict, Dendrou et al. have 
revealed a genetic effect that 
drives an optimal degree of 
immune signaling: low enough to 
be protective against autoimmu- 
nity but high enough to prevent 
immunodeficiency. These 
findings indicate that TYK2 may 
be a potential drug target ina 
number of autoimmune condi- 
tions. -OMS 


Sci. Transl. Med. 8, 363ral149 (2016). 


CONSERVATION 
Can Europe’s migratory 


birds be saved? 


Many migratory birds breed in 
Europe and overwinter in sub- 
Saharan Africa. In a Perspective, 
Bairlein outlines several factors 
that are causing their popula- 
tions to decline. Illegal takings 
and killings reduce populations, 
particularly during migra- 
tion. Climate change causes 
ecological mismatches between 
birds and their insect prey. But, 
perhaps most importantly, 
human land-use changes are 
causing habitat degradation and 
loss in the birds’ breeding and 
winter habitats, as well as along 
their migration routes. Political 
agreements and action plans 
are already in place but must be 
implemented to halt the popula- 
tion declines and avoid species 
extinctions. —JFU 

Science, this issue p. 547 


Published by AAAS 


QUANTUM GASES 
Shaking the lattice 
uncovers universality 


Most of our knowledge of 
quantum phase transitions 
(QPTs)—which occur as a 
result of quantum, rather than 
thermal, fluctuations—comes 
from experiments performed in 
equilibrium conditions. Less is 
known about the dynamics of 

a system going through a QPT, 
which have been hypothesized 
to depend ona single time 

and length scale. Clark et al. 
confirmed this hypothesis in a 
gas of cesium atoms in an opti- 
cal lattice, which was shaken 
progressively faster to drive the 
gas through a QPT. —JS 


Science, this issue p. 606 
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Quantifying the evolution 
of individual scientific impact 


Roberta Sinatra, Dashun Wang, Pierre Deville, 


Chaoming Song, Albert-Laszlé Barabasi* 


INTRODUCTION: In most areas of human 
performance, from sport to engineering, the 
path to a major accomplishment requires a 
steep learning curve and long practice. Science 
is not that different: Outstanding discoveries 
are often preceded by publications of less mem- 
orable impact. However, despite the increas- 
ing desire to identify early promising scientists, 
the temporal career patterns that character- 
ize the emergence of scientific excellence remain 
unknown. 
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RATIONALE: How do impact and productiv- 
ity change over a scientific career? Does im- 
pact, arguably the most relevant performance 
measure, follow predictable patterns? Can 
we predict the timing of a scientist’s out- 
standing achievement? Can we model, in 
quantitative and predictive terms, scientific 
careers? Driven by these questions, here we 
quantify the evolution of impact and pro- 
ductivity throughout thousands of scientific 
careers. We do so by reconstructing the publi- 


Frank A. Wilczek 


Physics Nobel, 
1993 1998 2003 2004 
John B. Fenn 
Chemistry Nobel, 
1991 1999 2002 


Random-impact rule. The publication history of two Nobel laureates, Frank A. Wilczek (Nobel 
Prize in Physics, 2004) and John B. Fenn (Nobel Prize in Chemistry, 2002), illustrating that the 
highest-impact work can be, with the same probability, anywhere in the sequence of papers 
published by a scientist. Each vertical line corresponds to a research paper. The height of each line 
corresponds to paper impact, quantified with the number of citations the paper received after 10 
years. Wilczek won the Nobel Prize for the very first paper he published, whereas Fenn published 
his Nobel-awarded work late in his career, after he was forcefully retired by Yale. [Image of Frank A. 
Wilczek is reprinted with permission of STS/Society for Science & the Public. Image of John B. 
Fenn is available for public domain use on Wikipedia.org. ] 
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cation record of scientists from seven disci- 
plines, associating to each paper its long-term 
impact on the scientific community, as quan- 
tified by citation metrics. 


RESULTS: We find that the highest-impact 
work in a scientist’s career is randomly dis- 
tributed within her body of work. That is, the 
highest-impact work can be, with the same 
probability, anywhere in the sequence of papers 
published by a scientist—it could be the first 
publication, could appear mid-career, or could 
be a scientist’s last publica- 
tion. This random-impact 
Read the full article rule holds for scientists in 
at http://dx.doi. different disciplines, with 
org/10.1126/ different career lengths, 
science.aaf5239 working in different dec- 
ades, and publishing solo 
or with teams and whether credit is assigned 
uniformly or unevenly among collaborators. 
The random-impact rule allows us to dev- 
elop a quantitative model, which systematically 
untangles the role of productivity and luck 
in each scientific career. The model assumes 
that each scientist selects a project with a 
random potential p and improves on it with 
a factor Q,, resulting in a publication of im- 
pact Q,;p. The parameter Q; captures the 
ability of scientist 7 to take advantage of the 
available knowledge in a way that enhances 
(Q; > 1) or diminishes (Q; < 1) the potential 
impact p of a paper. The model predicts that 
truly high-impact discoveries require a combi- 
nation of high Q and luck (p) and that increased 
productivity alone cannot substantially enhance 
the chance of a very high impact work. We 
also show that a scientist’s Q, capturing her 
sustained ability to publish high-impact papers, 
is independent of her career stage. This is in 
contrast with all current metrics of excellence, 
from the total number of citations to the h- 
index, which increase with time. The Q model 
provides an analytical expression of these tra- 
ditional impact metrics and allows us to 
predict their future time evolution for each 
individual scientist, being also predictive of 
independent recognitions, like Nobel prizes. 


CONCLUSION: The random-impact rule 
and the Q parameter, representing two fun- 
damental characteristics of a scientific career, 
offer a rigorous quantitative framework to ex- 
plore the evolution of individual careers and 
understand the emergence of scientific excel- 
lence. Such understanding could help us better 
gauge scientific performance and offers a path 
toward nurturing high-impact scientists, po- 
tentially informing future policy decisions. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: alb@neu.edu 
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Quantifying the evolution 
of individual scientific impact 


Roberta Sinatra,” Dashun Wang,” Pierre Deville,”” 
Chaoming Song,° Albert-Laszl6 Barabasi?”®** 


Despite the frequent use of numerous quantitative indicators to gauge the professional 
impact of a scientist, little is known about how scientific impact emerges and evolves in 
time. Here, we quantify the changes in impact and productivity throughout a career in 
science, finding that impact, as measured by influential publications, is distributed 
randomly within a scientist’s sequence of publications. This random-impact rule allows us 
to formulate a stochastic model that uncouples the effects of productivity, individual 
ability, and luck and unveils the existence of universal patterns governing the emergence of 
scientific success. The model assigns a unique individual parameter Q to each scientist, which 
is stable during a career, and it accurately predicts the evolution of a scientist’s impact, from 
the h-index to cumulative citations, and independent recognitions, such as prizes. 


roductivity, representing the number of 
publications authored by a scientist over 
time, and impact, often approximated by 
the number of citations a publication re- 
ceives (J-4), are frequently used metrics 
to gauge a scientist’s performance. Despite their 
widespread use, we lack a quantitative understand- 
ing of the patterns these metrics follow during 
a scientist’s career (5). This is particularly alarm- 
ing (6-11), given that they are increasingly adopted 
for academic assessment (4, 17) and serve as 
the input for numerous indicators, like the h- 
index and its variants, which are frequently 
used to compare individual performance (2-14). 
Given the increasing interest in predicting the 
value of these indicators (5, 15), here we ask: 
How do impact and productivity change over 
a typical scientific career? Does impact, arguably 
the most relevant performance measure, follow 
predictable patterns? Can we predict the timing 
of a scientist’s outstanding achievement? Can we 
untangle the role of impact, productivity, and luck 
within a scientific career? 
To address these questions, we reconstruct the 
publication profile of scientists from multiple 
disciplines and associate each of their publica- 
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tions with an impact, as captured by Cjo, the 
number of citations 10 years after publication 
(Fig. 1A; see Methods and section S1). 
Motivated partly by the exceptional aware- 
ness of a scientist’s highest-impact work (6, 17), 
like radioactivity for Marie Curie and the double 
helix for Watson and Crick, we identified for 
each researcher her most-cited paper, Cjo, that is, 
the paper with the highest number of citations 
10 years after its publication. The distribution 
P(cio) for the studied scientists indicates that 
only 5% have cio = 200; hence, most scientific 
careers have limited maximal impact. To system- 
atically distinguish the careers on the basis of 
their peak impact, we group each scientist in- 
to high maximum impact (top 5%, Cio = 200), 
low maximum impact (bottom 20%, Cio < 20), 
and medium maximum impact (middle 75%, 
20 < Gio < 200) categories (Fig. 1B and section $2). 


Productivity and impact patterns 
in scientific careers 


The total number of papers scientist 7 publishes 
up to time ¢ after her first publication, N,(4), 
asymptotically follows N;(¢) ~ t} (Fig. 1C) (18). 
Hence, yearly productivity, n,(¢), follows the 
same scaling with exponent (y; - 1) (fig. S5). 
Yet, the scaling exponent is different for low-, 
medium-, and high-impact scientists (Fig. 1C). 
We find that for low-impact scientists, (y) = 
1.55, indicating on average a steady increase 
in their productivity. The increase is much 
faster for high-impact researchers, for whom 
(y) = 2.05 (Fig. 1D). These trends are also con- 
firmed by changes in the yearly productivity 
(n(Q)): For high-impact scientists, productivity 
increases almost threefold during their career, 
whereas the increase is modest for low-impact 
scientists (Fig. 1E). Together, Fig. 1 (D and E) 
indicates that productivity changes through- 
out a scientific career. We find, however, that 
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this trend is modulated by impact: Productivity 
growth is more pronounced for high-impact sci- 
entists and is modest for low-impact scientists 
(Fig. 1, C to E). 

As Fig. 2A indicates, impact appears to follow 
similar patterns to productivity (Fig. 1E): Although 
Co increases during a high-impact scientist’s 
career, an increase is hardly noticeable for av- 
erage and low-impact individuals. Yet, we ob- 
serve a markedly different pattern if we examine 
the impact in the vicinity of ¢*, the publication 
time of the most-cited work cjg. Plotting (¢,9) for 
the sequence of papers before and after an in- 
dividual’s most-cited paper, cio (Fig. 2B), we do 
not see a gradual increase in impact as a sci- 
entist approaches ¢*, nor do we observe elevated 
citations after this breakthrough. Instead, the 
observed pattern exhibits a singular behavior. 
This singularity could be a simple result of 
averaging random-impact fluctuations present 
in careers. We find, however, that the result is 
robust if we use a moving average or consider 
only the publication with maximum impact in a 
rolling window (section $2.1 and fig. S6) and is 
validated using the fitting hypothesis ce (t) ~ ot, 
lacking differences in a, before or after ¢* (section 
$2.2 and fig. S7) (29). Also, the papers published 
before and after ¢* show no discernible differences 
in their average number of citations (Fig. 2C). 
Finally, we randomize each career by leaving all 
productivity measures [total number of papers, 
N;, and n{2)] unchanged but shuffling the impact 
of each paper within each career (Fig. 2C). The 
lack of differences between the original and 
the randomized careers supports our overall con- 
clusion: There are no detectable changes in im- 
pact leading up to or following a scientist's 
highest-impact work. We tested the robustness 
of this measure for different samples of scientists 
(figs. S8 and S9), for different definitions of im- 
pact (section S1.6 and fig. S10), and in data sets 
where we attribute different impact shares to 
each author of a paper (section S6 and fig. S11), 
arriving at the same conclusion. Yet, we can- 
not exclude that there are other choices of im- 
pact variables or data-set selection that can detect 
patterns before or after the highest-impact paper. 

To understand when a scientist publishes her 
most important work, we measured the proba- 
bility P(¢*) that the highest-impact paper is pub- 
lished at time ¢* after a scientist’s first publication 
(Fig. 2D). The high P(t*) between 0 and 20 years 
indicates that most scientists publish their highest- 
impact paper early or midcareer. The drop in 
P(t*) after 20 years suggests that it is unlikely 
that a scientist’s most-cited work will come late 
in her career, a result well documented by the 
literature about creativity (see section S3.1) (20, 27). 
To understand the origin of this pattern, we 
shuffled c;) among all papers published by the 
same scientist, preserving the scientist’s time- 
dependent productivity and paper-by-paper im- 
pact and randomizing only the order of her 
publications. The fact that P(¢*) for these synthetic 
careers is indistinguishable from the original data 
(Fig. 2D) indicates that variations in P(¢*) are not 
due to specific impact sequences or other features 
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Fig. 1. Patterns of A 400 
productivity during a 

scientific career. 

(A) Publication history of 300+ 


Kenneth G. Wilson (Nobel 
Prize in Physics, 1982). 
Horizontal axis indicates 
the number of years 
after the scientist's first 
publication, and each 
vertical line corresponds 


Kenneth G. Wilson 


Low max impact 
Medium max impact 


High max impact 


to a research paper. The 
height of each line cor- 


responds to Co, that is, 
the number of citations 
the paper received after 
10 years (sections S13 
and S16). The highest- 
impact paper of Wilson 
was published in 1974, 
9 years after his first 
publication, and it is the 
T/th of his 48 papers; 


A Cig < 20 


© ct) > 200 


hence, t* = 9, N* = 17, 
and N = 48. (B) Distri- 
bution of the highest- 


101 


impact paper P(cjp) across all scientists. We highlight in blue the bottom 20% of 
the area, corresponding to low maximum impact scientists (cio < 20); red area 
indicates the high maximum impact scientists (top 5%, cio = 200); yellow cor- 
responds to the remaining 75% medium maximum impact scientists (20 < 
ay * 200). These cutoffs do not change if we exclude review papers from our 


analysis (see figs. S4 and S36). (©) Number of papers N(t) published up to time t 


but are entirely explained by year-by-year var- 
iations in productivity throughout a career (fig. 
$12) (20, 27). 

These results prompted us to explore the po- 
sition N* of the highest-impact paper in the 
sequence of N publications of a scientist by 
measuring P(UN*/N), that is, the probability that 
the most-cited work is early (V*/N small) or late 
(N*/N =~ 1) within the sequence of papers pub- 
lished by a scientist. We find that PUV*/N) is flat 
(Fig. 2E, inset), a finding supported by the cu- 
mulative P(=N*/N) (Fig. 2E), which decreases 
independently of impact as (N*/N) 1, in line 
with a uniform P(N*/N). Together, we arrive 
at a rather unexpected conclusion, representing 
our main empirical finding: Impact is randomly 
distributed within a scientist’s body of work, 
regardless of publication time or order in the 
sequence of publications. We call this the 
random-impact rule because it indicates that 
the highest-impact work can be, with the same 
probability, anywhere in the sequence of N papers 
published by a scientist. We find that the random- 
impact rule holds for scientists in different dis- 
ciplines, with different career lengths, working in 
different decades, and publishing solo or with 
teams and whether credit is assigned uniformly 
or unevenly among collaborators (sections S14 
and S6.1) (22). 

The random-impact rule can explain the grow- 
ing impact during a scientist’s career (Fig. 2A). 
To see this, we again randomly shuffle the im- 
pact of the papers within each career, leaving the 


individual productivity unchanged. The variations 
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t in years v 


of impact of the randomized careers are indis- 
tinguishable from the original data for both high- 
and low-impact individuals (Fig. 2A). Hence, the 
growing average impact documented in Fig. 2A is 
the result of combining the increasing average 
productivity (Fig. 1E) with the heavy-tailed nature 
of the citation distribution (6, 23-25). (cio) is not 
stable but increases with the number of publica- 
tions, resulting in the observed growing impact 
(Fig. 2A). Hence, growing productivity, rather 
than increasing ability or excellence, can account 
for the growth in average impact during a career 
in science. 

The defining role of productivity in the timing 
of the highest-impact work persists if we extend 
the analysis to different samples of scientists, 
not only those with at least 20 years of pub- 
lication record. We considered different selec- 
tions of scientists, such as (i) grouping them 
by different career lengths (figs. S13 and 
S14), Gi) grouping them by decade of active 
career (figs. S15 and S16), (iii) removing multi- 
authored papers (fig. S17), (iv) including only 
papers published in one subarea of physics 
(fig. S38), (v) creating no filter and including 
all scientists (figs. S18 and S19), (vi) using 
different definitions of impact (figs. S20, S21, 
and S37), or (vii) considering the six different 
disciplines in data set (ii) (figs. S22 and $23). In 
all these cases, the location of the peak of the 
highest-impact work probability changes, but 
we never observe a difference with the random- 
ized careers. Hence, the specific shape of P(t*) is 
only a function of the selection of scientists and 


= 20 < cig < 200] 


10 100 300 500 
C10 
E 
AF T T T 7 
.° 
ef e 
37 won e* 
7 Se P sa "+ 
= 27 a 
~ “le 
me 
1 ae aasatbanans atin ef 
0 1 1 1 
6 0 10 20 30 40 
t in years 


for three scientists with low, medium, and high impact but with comparable final 
number of papers throughout their career. (D) Distribution of the productivity 
exponents y (18). The productivity of high-impact scientists grows faster than does 
that of low-impact scientists. (E) Dynamics of productivity, as captured by the 
average number of papers (n(t)) published each year for high-, average-, and low- 
impact scientists. tf = O corresponds to the year of a scientist's first publication. 


of their temporal productivity patterns, whereas 
impact is always randomly distributed within a 
scientist’s sequence of publications. 

The documented random-impact rule raises 
an important question: What is the role of a 
researcher’s own ability, if any, in scientific ex- 
cellence? We propose two quantitative models 
to answer this question. 


Random-impact model (R-model) 


We can rely on the random impact rule to build 
a null model of scientific careers: We assume 
that each scientist publishes a sequence of pa- 
pers whose impact is randomly chosen from the 
same impact distribution P(c¢o). Consequently, 
the only difference between two scientists is their 
overall productivity N. With the observed P(cio) 
and P(N) distributions (Fig. 3, A and B) as input, 
the obtained R-model (section S4.2) accurately 
reproduces the randomness of the impact se- 
quence P(N*/N) (Fig. 2E), but it also makes 
two predictions that are at odds with the data. 

(a) Productivity alone begets success: If 
each paper’s impact is randomly drawn from 
the same P(co), a productive scientist (high N) 
will more likely score a high cj (see eqs. S7 and 
S18) (26, 27). However, the R-model does not 
correctly reproduce the observed increase of (Cio) 
as a function of N (Fig. 3C). 

(b) Divergent impact: The higher the aver- 
age impact of a scientist’s publications with- 
out the most-cited publication (cio) (Fig. 1A), the 
higher the impact of the most-cited paper, Cio 
(Fig. 3D). Hence, papers with truly high impact 
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Fig. 2. Patterns of A B 
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published before and = <99 2% a San Resin 0 10 20 30 40 0 0.2 04 06 0.8 1 
after the highest- nn ee ag t N*/N 
impact paper cio of 
high-, middle-, and low-impact scientists. The plot indicates that there are no lication. Note that the drop after 20 years is partly because we focus on 
discernible changes in impact before or after a scientist's highest-impact work. careers that span at least 20 years (see fig. S22). (E) Cumulative distribution 
C) (cio) and (cio) before and after a scientist's most-cited paper. For each = P(=N*/N) for scientists with N ~ 50, where N*/N denotes the order N of the 


group, we calculate the average impact of the most-cited paper, (cio), as well 
as the average impact of all papers before and after the most-cited paper. We 
also report the same measures obtained in publication sequences for which 
the impact cj is fixed, whereas the impact of all other papers is randomly 
permuted. (D) Distribution of the publication time t* of the highest-impact 
paper for scientists’ careers (black circles) and for randomized-impact careers 
(gray circles). The lack of differences between the two curves (P = 0.70 for the 
Mann-Whitney U test between the two distributions) supports the random- 
impact rule; that is, impact is random within a scientist's sequence of pub- 


are published by scientists with a consistent rec- 
ord of high impact. The R-model cannot account 
for this behavior, predicting that (cio) diverges 
when (log (cio )) 1.97 (Fig. 3D), a consequence 
of the log-normal nature of P(c;9) (section S4.1 
and fig. S27). 

Failures (a) and (b) prompt us to abandon our 
hypothesis that research papers are all drawn 
from the same impact distribution and hence 
researchers have no distinguishable individual 
impact, forcing us to explore more closely the rela- 
tionship between productivity, impact, and chance. 


Q-model 


Crucially, in the R-model, scientists with similar 
productivity have indistinguishable impact. In 
reality, impact varies greatly between scientists 
(Fig. 3E), suggesting the existence of a hidden 
parameter Q; that modulates impact, which has 
a unique value for each scientist 7. 

The log-normal nature of P(cjo) (Fig. 3A) (24) 
indicates the presence of multiplicative processes, 
prompting us to write the impact Cio. of paper o 
published by scientist 7 as 


Coia = VUPa (1) 


where p,, is the potential impact of paper o in 
the sequence of papers published by scientist 7. 
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The parameter Q,; captures the ability of scientist 7 
to take advantage of the available knowledge in a 
way that enhances (Q; > 1) or diminishes (Q,; < 1) 
the potential impact of paper o. We take the value 
of this parameter Q; to be constant throughout a 
scientist’s career, a hypothesis we validate later 
(Fig. 5 and section S4.9). The obtained model as- 
sumes that each scientist randomly selects a 
project with potential p, and improves on it with 
a factor Q; that is unique to the scientist, resulting 
in a paper of impact (Eq. 1). Truly high-impact 
publications are therefore the result of a high 
Q; scientist selecting by chance a high 7,, project; 
any scientist, independently of her parameter Q,, 
can publish low-impact papers by selecting a 
low Dg. 

The stochastic process behind the model 
(Eq. 1) is determined by the joint probability 
P(p, Q, N), with unknown correlations between 
p, Q, and N. The log-normal nature of P(c) (Fig. 
3A) allows us to measure P(p), finding that it 
can also be fitted with a log-normal function 
(Fig. 3F). Assuming that Q is also a log-normal 
(confirmed later), we denote with p = logp, 
Q = log Q, N= logN, obtaining the trivariate 
normal distribution P(é, Q, N )=N(u, 2). 
Using a maximum-likelihood approach (see 
section S4.4), we estimate from the data the 
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highest-impact paper in a scientist’s career, varying between 1/N and 1. The 
cumulative distribution of N*/N is a straight line with slope 1, indicating 
that N has the same probability to occur anywhere in the sequence of papers 
published by a scientist. The flatness of P(N*/N) (all scientists, inset) sup- 
ports the conclusion that the timing of the highest-impact paper is uniform. 
The small differences between the three curves are due to different number 
of publications N in the three groups of scientists [see fig. S24 for the plot 
of P(=N*/N) for other values of N and figs. S25 and S26 for the impact 
autocorrelation throughout a scientific career]. 


mean Lt = (Upbabw) = (0.92,0.93,3.34) and the 
covariance matrix 


2 
OF SQ SN 


= 2 
> =| SQ °QNn 
OpN OQN ON 


0.93 0.00 0.00 
=| 0.00 0.21 0.09 (2) 
0.00 0.09 0.33 


The matrix (Eq. 2) leads to two key predictions: 

(i) 6p, = Sp,q = O indicates that the paper 
potential impact p, is independent of a scientist’s 
productivity N; and her hidden parameter Q,. 
Therefore, scientists select the potential impact 
of each paper randomly from a P(p) distribution 
that is the same for all individuals, being inde- 
pendent of Q and N, capturing a universal—that 
is, scientist-independent—luck component behind 
impact. 

(ii) The nonzero og, indicates that the hidden 
parameter Q and productivity N do depend on 
each other (section $4.4), but its small value 
also shows that high Q is only slightly associated 
with higher productivity. 

The lack of correlations between p and (Q,N) 
allows us to analytically calculate the dependence 
of the highest-impact paper (log cio) on productivity 
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Fig. 3. The Q-model. 


(A) Distribution of the 
paper impact cio across 
all publications in the 
data set. The gray line 
corresponds to a 
log-normal function 
with average up = 1.93 
and SD o* = 1.05 (R?= 
0.98). (B) Distribution 
of the total number 


on 
T 


log (cio) 
i 
log (cto) 


o Data 7 
— R-model 
aL — Q-model 
10! 10° 


of papers published by 
a scientist (productivity). 
The gray line is a log- 
normal with p = 3.6 
and o° = 0.57 [weighted 
Kolmogorov-Smirnov 
(KS) test, P = 0.70]. 
(C) Citations of the 
highest-impact paper, 
Cio, versus the number 
of publications N during 
a scientist's career. Each gray point of the scatterplot corresponds to a scientist. 
The circles are the logarithmic binning of the scattered data. The cyan curve 
represents the prediction of the R-model, assuming that the impact of each 
paper is extracted randomly from the distribution P(cj9) of Fig. 2A. The red curve 
corresponds to the analytical prediction (see eq. S35) of the Q-model (R* = 
0.97; see section $4.6 and fig. S29 for goodness of the fit). (D) logcig ver- 
sus (log cjg’). Each gray point in the scatterplot corresponds to a scien- 
tist, where (log cio’) is the average logarithm of her paper impact, excluding 
the most-cited paper cig. We report in cyan the R-model prediction and in red 
the analytical prediction (see eg. S36) of the Q-model (R* = 0.99; see section 
$4.6 and fig. S29 for goodness of the fit). (E) Cumulative impact distribution 
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of all papers published by three scientists with the same productivity, N ~ 100, 
but different Q. (F) Distribution P() across all publications. For each paper a 
of scientist i, we have log p, = log Cioia — log Q;, where log Q; = (Cioj) - (p). 
Therefore, the distribution of p = logp, except for a common translational 
factor utp, corresponds to the distribution of log Cioia — { log Cio,;), which is 
a normal with u = O and o* = 0.95 (KS test, p = 0.48). (G) Distribution of 
parameter Q, P(Q), for all scientists. The gray line corresponds to a log- 
normal function with » = 0.93 and o* = 0.46 (weighted KS test, p = 0.59). 
(H) Cumulative distribution of the rescaled impact Cio0,i./Q; for the three 
scientists in (E). The black line corresponds to the universal distribution P(p). 
The collapse is predicted by Eq. 1. 


N (section $4.10) and on the average impact of 
the other papers published by the same sci- 
entist (cio) (see $4.10). The model prediction for 
(log Gio)(N) and (log cio) ((Go')) is in excellent 
agreement with the data (Fig. 3, C and D, and 
fig. S30), indicating that the hidden parameter 
Q and variations in the productivity N can ex- 
plain the empirically observed impact differences 
between scientists, correcting the shortcomings 
of the R-model. 

In summary, the Q-model allows us to generate 
synthetic sequences of publications, by assigning 
to each scientist an individual parameter Q and 
a productivity N, extracted from the distribution 
P(Q, N). Each paper in the sequence is assigned 
an impact calculated as p x Q, where p is ran- 
domly drawn from the distribution P(p), identi- 
cal for all scientists. 


The measurement and accuracy 
of the hidden parameter Q 


The model allows us to calculate the parameter 
Q,; from the sequence of publications ¢9,,, of each 
scientist (section S4.5), obtaining for large N; 
(see eq. S28 for finite N; and fig. S28 for the 
relation between the two estimations of the Q 
parameter) 


Qj = e(18 ¢102)-HP (3) 


Given its dependence on log co, Q is not 
dominated by a single high-impact (or low- 
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impact) discovery but captures instead a sci- 
entist’s sustained ability to systematically turn 
her projects into high-impact (or low-impact) 
publications. For example, although the three 
scientists in Fig. 3E have the same productivity 
N ~ 100, Eq. 3 predicts widely different Q values 
for them, namely, Q = 9.99, 3.31, and 1.49. These 
values accurately reflect persistent differences in 
their sequence of publications: The Q = 9.99 
researcher consistently publishes high-impact 
papers, whereas the publications of the Q = 1.49 
researcher are consistently of limited impact. 
Hence, the parameter Q captures a scientist’s 
differentiating ability to take random projects 
p and systematically turn them into high-impact 
(or low-impact) publications. 

The Q-model makes the unexpected predic- 
tion that despite the obvious differences in in- 
dividual career paths, differences in the impact 
of individual papers should disappear if we use 
the reduced variable p. = C1o,iq/Q;, a rescaling 
standard in statistical physics (28, 29). Although 
the individual P(c;,.) distributions differ greatly, 
the P(Cioj./@,) distributions for all scientists collapse 
into a single universal curve P(p) (Fig. 4B), confirm- 
ing the universal nature of impact across all 
careers (30). Finally, the log-normal P(Q) (Fig. 
3G) confirms the model’s mathematical self- 
consistency. 

A fundamental limitation of all metrics used 
in science is their nonstationarity: Productivity, 
the cumulative number of citations, and the h- 
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index all grow in time, making it difficult to 
compare individuals at different stages of 
their career. In contrast, we find that the Q 
parameter is independent of the career stage. 
To show this, we used a AN = 30 paper window 
to measure changes in Q during the career of a 
scientist, observing that the Q parameter fluc- 
tuates narrowly throughout each career, without 
systematic changes (Fig. 5A). The magnitude of 
these fluctuations is explained for 75% of 
scientists by the stochastic nature of Q (section 
$4.9), because the estimated Q parameter lies 
within the uncertainty envelope provided by the 
model. In the remaining 25% of the cases, the 
variation in Q is slightly higher than the vari- 
ation predicted by the stochastic nature of the 
model (Fig. 5B). However, the magnitude of this 
surplus variation never exceeds 15%, and the aver- 
age relative error is always below 10% (section 
$4.9 and figs. S31 and S32). 

Finally, to test the stability of the Q param- 
eter throughout the overall career, and not as a 
function of productivity, N, we consider careers 
with at least 50 papers and calculate their early 
and late Q parameters (Qeany aNd Qhate, respec- 
tively) using Eq. 3 on the first and second half of 
their papers, respectively. In this case, the sto- 
chastic uncertainty explains the differences be- 
tween Qeary aNd Qiate for the large majority of 
scientists (95.1%, Fig. 5C). Together, these mea- 
surements indicate that the Q parameter is 
generally stable throughout a career, allowing 
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Fig. 4. Careers and 
their Q parameter. A 


Increasing Q 


(A) Left: Analytically 


predicted cumulative 
impact distributions 
for different Q. The 
plot also highlights 
the impact distribution 
of the three scientists 
shown in Fig. 2E. The 
detailed publication 
record of each scientist 
is reported on the 
right, documenting the 
notable differences 
between them, given 
their different Q. 

(B) Left: Individual 
cumulative impact 
distributions P(cio,/). 
Given the modest 
number of pub- 
lications N character- 


P (> cio) 


izing most scientists 

and the impossibility 

to compute statisti- 

cally meaningful dis- B 


tributions for many of 
them, each distribution 
is computed across 

all publications of all 
scientists with the 
same Q;. The color code 
captures their Q 
parameter, as shown in 
(A). Right: Cumulative 
distributions of the 
rescaled impact C10;/Q; 


P (> c10,1) 


Rescaling 


= ——) 


for the scientists, indi- 
cating that the individ- 
ual distributions 
collapse on the univer- 
sal distribution P(p). 


us to offer quantitative predictions on the evo- 
lution of a scientific career. 


The predictive power of the hidden 
parameter Q 


The true value of the Q parameter comes in its 
predictive power: 

(i) The Q parameter allows us to estimate the 
number of papers a scientist needs to write so 
that her highest-impact paper gathers cio cita- 
tions (Fig. 6B). We find that scientists with low 
Q (=1.2) must write at least 100 papers so that 
one of them gathers on average 30 citations. 
Yet, a scientist with the same productivity but 
Q = 10 is expected to author a Cjo = 250 paper. 
Doubling productivity will enhance only with 
seven citations the highest-impact paper of the 
low-Q scientist (Q = 1.2), whereas it will boost 
with more than 50 citations cjo for the high-Q 
scientist. Overall, Fig. 6B documents that for 
low-Q scientists, increased productivity cannot 
boost substantially the chance of publishing a 
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high-impact work; hence, it is very unlikely that 
they “get lucky.” 

(ii) A scientist’s h-index, indicating that her 
h most-cited papers gather at least h citations 
(12, 15), is jointly determined by the Q param- 
eter and the productivity N (section $4.11). This 
analytical prediction reproduces not only the 
observed h-index of all scientists (fig. S33B) but 
also the evolution of the A-index during a sci- 
entist’s career (Fig. 6, C and D, and fig. S34A). 
Similar equations describe the cumulative num- 
ber of citations (Fig. 6D and figs. S33, D to F, 
and S34B) and the g-index (section $4.11), indi- 
cating that the traditional performance measures 
are uniquely determined by Q. Given that Q is 
constant in time, we conclude that productivity 
only can account for career-wide changes in these 
measures (Fig. 6, C and D). 

Giii) By determining the value of Q during the 
early stages of a scientific career, we can use it 
to predict future career impact. The estimation 
error AQ of Q decreases with the number of pub- 


13.3 


7.8 


4.5 
100 
50 7. lg 5 , 
Dl, 0! 


1.6 200 


C10 
pe 
a 
Si 


C10 
ea 
a 
S 


Time (in years) 


€10,1/Qi 


lished papers N and drops below 10% already 
after N = 20 publications (section $4.12). We can 
therefore estimate Q based on a scientist’s first 
No published papers in Eq. 3 and then use the 
analytical expression of the h-index and of the 
total number of citations to predict the future 
impact of a scientist (section $4.12 and fig. S35). 
Given the stochastic nature of the Q-model, an 
uncertainty envelope accompanies the most likely 
value of each impact metric. In Fig. 6E, for two 
scientists, we show the /-index prediction up to 
NN = 150 after we estimated Q from the first No = 
20 (top) and No = 50 (bottom) papers. Although 
the initial h-index overlaps for the two scientists, 
their long-term impact diverges, a difference ac- 
curately predicted by the Q-model. Generalizing 
for a larger sample of scientists, we find a strong 
correlation between the predicted and observed 
h-index (Fig. 6F). To quantify the Q model's over- 
all predictive accuracy, we measured the fraction 
of times that the -index falls within the envelope 
for scientists with at least 100 papers. The zy 
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score for each scientist captures the number 
of SDs the real h-index deviates from the most 
likely h-index after N publications. We find 
that 71% of scientists have 249 < 2 based on No = 
20, which improves to 81% for No = 50 and 2% 
(Fig. 6G). Together, we conclude that the es- 
timation of the Q parameter at early stages 
has the capability to unveil the long-term career 
impact. 

(iv) To test whether Q; correlates with out- 
standing impact, we ranked scientists on the 
basis of Q, N, Ciots Cio and their h-index. To val- 
idate these rankings, we use a receiver operat- 
ing characteristic (ROC) plot that measures the 
fraction of Nobel laureates at the top of the 
ranked list (Fig. 6A). We find that the Q-based 
ranking predicts Nobel-winning careers most 
accurately, offering the highest area of all rank- 
ing measures (Fig. 6A) and the highest precision 
and recall (section S7 and fig. $45). Equally no- 
table is the finding that the predictive powers of 
Ctot: Cio, and the h-index are indistinguishable 
from each other and that the productivity N is 
the least predictive. Similar results are obtained 
if we use Q; to detect Dirac and Boltzmann 
medalists (figs. S46 and S47). The early-career 
Q has also the best accuracy in predicting Nobel 
laureates (section $7.1 and fig. S48). 

High-impact discoveries often result from col- 
laborative work (31-33), mixing scientists with 
different Q;. To explore the influence of collab- 
orators (34, 35), we used a credit allocation al- 
gorithm (22, 36) to attribute different impact 
shares to each author. We then repeated our 
entire analysis, finding that the Q-model, with 
slightly revised parameters, can explain the re- 
sults (section S6.1 and figs. S40 to S43). Further, 
we find that Q; is robust to the omission of in- 
dividual collaborators (section S6.2 and fig. $44). 
Hence, although collaborative and team effects 
modulate the success of a particular publication, 


individual collaborators have only limited influ- 
ence on Q;. 


Summary and discussion 


In summary, we offer empirical evidence that 
impact is randomly distributed within the se- 
quence of papers published by a scientist, im- 
plying that temporal changes in impact during 
a scientific career can be explained by temporal 
changes in productivity, luck, and the heavy- 
tailed nature of a scientist’s individual impact 
distribution. This finding allowed us to system- 
atically untangle the role of productivity, luck, 
and a scientist’s Q, predicting that truly high- 
impact papers require a combination of high 
Q and luck (high p) and that high productivity 
alone has only a limited effect on the like- 
lihood of high-impact work in a scientific career, 
if it is not associated with high Q. The mea- 
surable Q parameter represents a scientist’s 
sustained ability to publish high-impact (or 
low-impact) papers. 

Virtually, all currently used metrics of per- 
formance change during the career of a scien- 
tist, capturing progression, not sustained ability. 
In contrast, Q is constant throughout a scientist’s 
career for most scientists (76%), and it is not dom- 
inated by a single paper or collaborator, being 
a measure intrinsically linked to an individual. 
The fundamental nature of the Q parameter is 
supported by the fact that the currently used 
metrics of success, from the h-index to cumulative 
citations, can be calculated from it. Q predicts not 
only the value but also the time evolution of the 
traditional impact metrics (Fig. 6, C to F). 

All findings presented above are based on a 
subset of 2887 physicists with a career spanning 
at least 20 years and a persistent publication 
record. These scientists have reached a mid- or 
late-career stage and hence can be considered 
successful as they survived many selection pro- 


cesses in academia. Although our findings hold 
in at least six more different disciplines (see 
section $1.2) and are robust to relaxing the se- 
lection criteria (see section S1.4), the studied data 
sets do not feature young scientists who have left 
academia early and hence have published only a 
few papers. 

Throughout this work, we have treated long- 
term impact, as captured by cjo, as an exogenous 
variable. It seems reasonable, however, that pro- 
ductivity and impact could influence each other. 
From a mechanistic perspective, for example, some 
early promising publications might help attract the 
resources leading to further productivity growth. 
Early-career impact, quantified with the average 
(Cio) for the first 10 papers of a scientist, is as- 
sociated with career longevity, indicating that 
the probability to stay in academia is slightly 
influenced by the impact of a scientist’s early 
publications (fig. S49). The Q-model also indi- 
cates that the overall number of papers in a 
career weakly correlates with high Q (Eq. 2). 
Although the Q-model and the predictions pro- 
vided here are immune from a possible coupling 
between early impact and overall productivity 
(section S5), these preliminary findings call for 
more measurements and models that can accu- 
rately capture the coevolution of short-term early 
impact and productivity (37). 

Although Q can accurately predict a career 
impact, the dependence of Q on exogenous fac- 
tors, such as the quality of the education and cur- 
rent institution (38, 39), size of the research 
community (24, 40), gender (41, 42), dynamics 
of subfields (43, 44), or publication habits, re- 
mains unknown. Mathematically speaking, the 
model remains the same if the Q parameter re- 
flects other factors that characterize a scientist. 
The various robustness checks we performed to 
discover possible confounding factors, such as 
career length, decade, team effects, and the analysis 
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fluctuations are 
because we have a 


finite number of papers in the moving window, the magnitude of the 
changes being comparable to that predicted by the model with a constant 
Q (section S4.9). (B) Fluctuations of the Q parameter in model and data. 


Q(AN 
0 


We study the distribution of the uncertainty, 


thetic careers with constant Q (AN = 5). For 74.7% of the scientists, the 
fluctuations are comparable to those of the model. For the remaining 25.3%, 
the SD is slightly higher than the one predicted by the model. (C) Comparison 
between early and late Q parameter. We compare the Q parameter at early- 
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career (Qearly) and late-career (Qiate) stage of 823 scientists with at least 
50 papers. We measured the two values of the parameters using only the 
first and second half of published papers, respectively. We perform these 
measurements on the real data (circles) and on randomized careers, where 
the order of papers is shuffled (gray shaded areas). For most of the careers, 
95.1%, the changes between early- and late-career stages fall within the 
fluctuations predicted by the null model with randomized paper order, indicating 
that the Q parameter is stable throughout a career. The observed fluctuations 
are explained by the finite number of papers in a scientist's career. 
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of different disciplines and data sets, have failed to 
offer a simple, straightforward explanation for 
the origin of the different Q values scientists 
have. Most likely, the Q parameter is affected by 
multiple factors, rather than a single one, and 
more information about its nature might be un- 
veiled once other detailed career information, 
such as grants and awards, will be available and 
included in the analysis. Nevertheless, the key 
differentiating factor of Q from luck is that it 
has to be sustained. Q is not determined by a 
single paper, a lucky draw, but by a sustained 
high performance, throughout the scientist’s 
career. This is reflected in the (log co) term in 


Eq. 3, indicating that a single very high impact 
paper has only a small impact on Q. A scientist 
needs multiple high c,) papers to ensure a high 
Q. Uncovering the origin of the Q parameter is a 
promising future goal, which not only could offer 
a better understanding of the emergence and 
evolution of scientific excellence but also might 
improve our ability to train and nurture high- 
impact scientists. 


Methods 
Data sets 


We explore two types of data sets: (i) the pub- 
lication record of 236,884 physicists publishing 


in the journal family Physical Review from 1893 
to 2010 [American Physical Society (APS) data 
set, see section S1.1 and figs. S1 and S2] and (ii) 
the combination of 24,630 Google Scholar ca- 
reer profiles with Web of Science (WoS) data, 
covering 514,896 publications in biology, chem- 
istry, cognitive sciences, ecology, economics, and 
neuroscience (WoS data set, described in section 
S12 and fig. $3). The results shown in this article 
refer to 2887 scientists, whose publication record 
spans at least 20 years, who have at least 10 pub- 
lications and have authored at least one paper 
every 5 years, derived from the APS data set (see 
section S1.3). 
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Observed versus pre- 

dicted growth of the h- 
index for scientists with different Q. The plot documents the agreement 
between the analytically predicted h-index (eq. S38, continuous line) and 
the observed value (A(N)), obtained by averaging the h-index for scientists 
with the same Q (circles). (D) Top: Growth of the h-index for two scientists 
with at least 200 papers and different Q as a function of the productivity N 
(blue circles), compared with the prediction of eq. S38 (red line). Bottom: For 
the two scientists in the top panels, we measure the cumulative number of 
citations as a function of N, Ctot (N), and compare with the prediction of eq. 
S39. The close agreement between observation and prediction in (C) and 
(D) shows that the time-independent Q captures an intrinsic property of a 
scientist and that other indicators, like the h-index or cumulative citations, 
are uniquely determined by Q and productivity. (E) For two scientists, we show 
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the h-index prediction as a function of N using only early-career information, 
namely, No = 20 (top) and No = 50 (bottom), to estimate the Q parameter. 
Although the initial h-index up to No = 20 highly overlaps for the two 
scientists, their long-term impact diverges, a difference accurately predicted 
by the Q-model. (F) Scatterplots of predicted and real h-index at N = 60 
based on Q estimated at No = 20. The error bars indicate prediction 
quartiles (25 and 75%) in each bin and are colored green if y = x lies between 
the two quartiles in that bin and red otherwise. The circles correspond to 
the average h-index in that bin. (G) The zy score for each scientist cap- 
tures the number of SDs the real h-index deviates from the most likely 
h-index after N publications. zy < 2 indicates that the real data are within the 
prediction envelope. 
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Note that the APS data set contains only cita- 
tions within the Physical Review corpus (see section 
SL); for this reason, the specific number citation 
numbers are systematically smaller compared to 
the citations reported by the WoS database. 

Our findings are also supported by the anal- 
ysis of different samples of scientists in the APS 
data set, selected using a number of different 
criteria (see section S1.4), and by the analysis of 
all other disciplines in the WoS data set, which 
are reported in the Supplementary Materials 
and referenced throughout the article. 


Citation measures 


Citation-based measures of impact are affected 
by three major problems: (i) citations follow dif- 
ferent dynamics for different papers (6, 45), (ii) 
the average number of citations changes over time 
(24) and (iii) citation count is subfield-dependent 
(24). To overcome (i) for each paper, we use the 
cumulative number of citations the paper received 
10 years after its publication, cj, as a measure 
of its scientific impact (6, 45). We can correct for 
(ii) and (iii) by normalizing c) by the average (Cio) 
of papers published in the same year. Because 
these corrections do not alter our conclusions 
for the APS data set, we report results without 
normalization. For the WoS data set, we instead 
used normalized citation counts. 


Q-model 


The stochastic process behind the Q-model is 
determined by the joint probability P(p,Q,N). 
The model assumes that a scientist 7 has a pro- 
ductivity N; and a parameter Q; sampled from 
the marginal distribution P(Q,N), and then ex- 
tracts N; values of p from the conditional dis- 
tribution P(p|Q,N). By assuming that P(p,Q,N) 
follows a trivariate log-normal distribution with 
parameters 1 and &, we can write the likelihood 
function £; that a scientist 7 with Q; and N; has 
a sequence of papers {a} with impact {Q;p,} 
(see Eq. 1). Finally, with numerical optimization 
methods, we identify the maximum of the over- 
all log-likelihood function log £ = Y; £; which 
provides the numerical estimate of 1 and © re- 
ported in Eq. 2 (see also sections $4.3 and S4.4). 
This approach also estimates Q, obtained by 
maximizing the likelihood function £; for each 
scientist. The maximization provides an analytical 
expression for Q;, which, for large productivity 
N;, converges to Eq. 3 (see section $4.5) 

This procedure and the measured parame- 
ters allow us to generate synthetic sequences 
of publications: We first extract an individual 
parameter Q and a productivity N from the 
distribution P(Q,N). Then, each paper in the 
synthetic sequence is assigned an impact pQ, 
where p is randomly drawn from the distribution 
P(p), identical for all scientists. 
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protein side-chain diversity 


Tom H. Wright, Ben J. Bower, Justin M. Chalker, Goncalo J. L. Bernardes, 

Rafal Wiewiora, Wai-Lung Ng, Ritu Raj, Sarah Faulkner, M. Robert J. Vallée, 
Anuchit Phanumartwiwath, Oliver D. Coleman, Marie-Laétitia Thézénas, Maola Khan, 
Sébastien R. G. Galan, Lukas Lercher, Matthew W. Schombs, Stefanie Gerstberger, 
Maria E. Palm-Espling, Andrew J. Baldwin, Benedikt M. Kessler, 

Timothy D. W. Claridge, Shabaz Mohammed, Benjamin G. Davis* 


INTRODUCTION: Natural posttranslational 
modifications (PTMs) to proteins expand the 
chemical groups available to proteins. The 
ability to expand posttranslational functional 
group diversity in an unbounded 
manner could, in principle, allow 
exploration and understanding of 
how these groups modulate biolog- 
ical function. Natural PTMs feature 
bonds to heteroatoms (non-carbon) 
made at the y (Cys Sy, Thr Oy, Ser Oy) 
or w (Lys Nw, Tyr Ow) positions of side chains. 
However, one of the central features of bio- 
molecules is C (sp*)-C (sp*) bond formation. Be- 
cause all amino acid side chains contain this C-C 
bond, mastering its construction on proteins 
could allow free-ranging structural alteration of 
residues in proteins (both natural and unnatural). 


RATIONALE: In principle, C (sp*)-C (sp”) dis- 
connections at the B,y C-C bond would allow the 
chemical installation of a wide range of amino 
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acid functionalities. Traditional two-electron 
chemistry (using nucleophiles and electrophiles) 
requires reagents that are often incompatible 
with biological substrates and/or water. Free 
radicals can be tolerant of aqueous 
conditions and unreactive (and there- 
by compatible) with the majority of 
functionality present in biomolecules. 
We therefore reasoned that mild, 
carbon-centered free radical chem- 
istry would be enabled by matching 
free-radical reactivity with a suitable, uniquely 
reactive functional group partner that pos- 
sesses a chemical affinity for such singly oc- 
cupied molecular orbitals. The amino acid 
residue dehydroalanine (Dha) can be read- 
ily introduced in a site-selective manner ge- 
netically, biosynthetically, or chemically; upon 
reaction with a suitable radical, Dha would 
favorably generate a stabilized Ca radical 1. 
Suitable “quenching” of the central Co. radical 
intermediate 1 generated after formation of 


the critical C-C bond would thus allow “chem- 
ical mutation” of the side chain. 


RESULTS: A range of precursor halides (R-Hal, 
Hal = I or Br) allowed the creation of radicals 
Re. These radicals reacted selectively with 
Dha in peptides and proteins with excellent 
site selectivity and regioselectivity (>98% B,y) 
and typically with a diastereoselectivity of ~1:1. 
Combined use of R-Hal with NaBH, under low- 
oxygen conditions suppressed competing oxi- 
dation and disubstitution side reactions of 
intermediates 1. This allowed for rapid reactions 
(typically 30 min) with improved efficiency across 
a range of representative protein types and scaf- 
folds (all a, a/8 folds, all B, receptor, enzyme, 
antibody). The reactivity of primary, second- 
ary, and tertiary alkyl halides allowed installa- 
tion of natural, simple hydrophobic residue side 
chains. Charged or polar protic (e.g., OH, NH) 
functionality in amino acid side chains was also 
possible. Even the use of side-chain reagents 
in unprotected form proved possible, thus high- 
lighting not only exquisite chemoselectivity 
but also compatibility with common biological 
functional groups. These transformations en- 
abled the creation of a wide diversity of natural, 
unnatural, posttranslationally modified (methyl- 
ated, glycosylated, phosphorylated, hydroxylated) 
and labeled (fluorinated, isotopically labeled) 
side chains, as well as difficult-to-access but im- 
portant residues in proteins (e.g., methyl-Arg, 
citrulline, ornithine, methyl-Gln, phospho-Ser). 


CONCLUSION: This approach to chemical edit- 
ing of amino acid residues, outside of the rigid 
constraints of the ribosome and enzymatic proces- 
sing, may prove to be a general technology for ac- 
cessing diverse, previously unattainable proteins. 
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C-C bond formation allows construction of many side chains, not just the modification of existing natural amino acid residues. t-Leu, tert-leucine; Orn, 
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Posttranslational mutagenesis: 
A chemical strategy for exploring 
protein side-chain diversity 
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Benedikt M. Kessler,” Timothy D. W. Claridge," 


Shabaz Mohammed,’ Benjamin G. Davis’|| 


Posttranslational modification of proteins expands their structural and functional 
capabilities beyond those directly specified by the genetic code. However, the vast diversity 
of chemically plausible (including unnatural but functionally relevant) side chains is not 
readily accessible. We describe C (sp*)—C (sp*) bond-forming reactions on proteins 
under biocompatible conditions, which exploit unusual carbon free-radical chemistry, 

and use them to form Cp-Cy bonds with altered side chains. We demonstrate how these 
transformations enable a wide diversity of natural, unnatural, posttranslationally modified 
(methylated, glycosylated, phosphorylated, hydroxylated), and labeled (fluorinated, 
isotopically labeled) side chains to be added to a common, readily accessible 
dehydroalanine precursor in a range of representative protein types and scaffolds. 

This approach, outside of the rigid constraints of the ribosome and enzymatic processing, 
may be modified more generally for access to diverse proteins. 


atural posttranslational modifications (PTMs) 

to proteins partially expand the chemical 

groups available to proteins, modulating 

both structure and function (Fig. 1A) (J). 

For example, protein glycosylation tunes 
both physical (e.g., solubility, stability, folding) 
and biological (e.g., immune response, cell ad- 
hesion events, signaling) activity (2). Phospho- 
rylation, one of the most frequently occurring 
posttranslational modifications, is widely used 
in nature as a powerful functional activation 
mechanism (as an “on switch”) for proteins (3). 
Even relatively small modifications such as meth- 
ylation have been shown to be critical in a range 
of pathways with diverse biological effects, such 
as the transcriptional regulation mediated by 
histone proteins (4). The ability to expand post- 
translational functional group diversity in an un- 
bounded manner could therefore, in principle, 
allow exploration and understanding of even 
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greater and more diverse effects in modulation 
of biological function. 

The vast majority of all known natural PTMs 
feature bonds to heteroatoms (non-carbon) made 
at the y (Cys Sy, Thr Oy, Ser Oy) or (Lys No, 
Tyr Ow) positions of side chains (5). Yet one of 
the central features of living “organic” matter 
is that it exploits carbon’s ability as an element 
to catenate [typically through C (sp?)-C (sp?) 
bond formation], providing one of nature’s most 
important structural motifs. Because all amino 
acid side chains contain this bond, mastering 
its construction on proteins could allow free- 
ranging structural alteration of residues in pro- 
teins (both natural and unnatural) and thence 
functional reprogramming. Such extension of the 
chemical space accessible to protein engineering 
could be considered a near-unlimited form of syn- 
thetic biology, a form of “chemical mutagenesis” 
(6, 7). 

Site-directed mutagenesis has revolutionized 
the study and understanding of proteins (8, 9). 
This now long-standing technique, however, is 
generally restricted to the 20 natural amino acid 
building blocks by the high selectivity of natural 
aminoacyl tRNA synthetases and the limited 
plasticity of the ribosome (J0), which creates an 
effective “filter” to translation. The incorporation 
of unnatural amino acids expands a protein’s 
functional capacity and can provide insight into 
biochemical mechanisms (17-13). Some strategies 
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for the incorporation of noncanonical residues 
have emerged as a powerful route to unnatural 
mutant proteins. Biological techniques such as 
amber codon suppression (J4), while useful, re- 
main limited in scope of structural variation (73) 
by the tolerance of the translational machinery 
and hence must be optimized on a case-by-case 
basis (12). The total or partial synthesis of pro- 
teins has been made possible by powerful native 
chemical ligation techniques (15). However, there 
remain restrictions on the size of the proteins 
that can be readily synthesized; such syntheses 
typically require many steps followed by correct 
refolding, and hence expertise is required for all 
but the simplest protein targets (16). 

An alternative, divergent, and potentially un- 
limited approach would be to incorporate a sin- 
gle amino acid that can act as a general chemical 
precursor for any desired side chain, whether 
natural or unnatural (77). The introduction of var- 
ious chemical “tags” (18) that allow selective pro- 
tein modification via reactivity compatible with 
that of natural biomolecules (19) has been an im- 
portant step toward this goal. However, current 
protein modification approaches rely on unnatural 
carbon-heteroatom linkages (20) that do not con- 
struct the C-C framework found in biology and 
thus cannot be used for the site-selective intro- 
duction of natural posttranslational modifications 
and their modified variants (79). Thus far, the 
formation of carbon (sp3)-carbon (sp3) bonds for 
protein modification has remained out of reach, 
despite the ubiquity of the C-C bond in amino 
acid side chains. Access to such reactivity would 
enable the rapid and divergent exploration of both 
natural and unnatural “side chain” space from 
a readily accessible precursor. 

Here, we demonstrate that such C (sp)-C (sp*) 
bond formation is possible in a protein-compatible 
manner by exploiting the selectivity of carbon- 
centered radicals to allow a form of general post- 
translational mutagenesis (7, 17, 21). 


Design of a biocompatible C (sp*)—C (sp*) 
bond-forming reaction 

We envisaged a unique strategy through retro- 
synthetic analysis (Fig. 1B). In principle, C (sp®)-C 
(sp?) disconnections at the B,y C-C bond would 
allow the chemical installation of not only nat- 
ural amino acid residues but also their posttrans- 
lationally modified variants and a wide range of 
unnatural amino acids. Traditional two-electron 
chemistry (using nucleophiles and electrophiles; 
Fig. 1B) results in consideration of reagents likely 
incompatible with biological substrates by virtue 
of their reactivity with water and/or the func- 
tional groups found in natural biomolecules. We 
considered that single-electron chemistry might 
prove more compatible with proteins, because 
suitable free radicals are tolerant of aqueous con- 
ditions (22) and are unreactive (and thereby com- 
patible) with the majority of existing functionality 
present in biomolecules [“bio-orthogonal” (19)]. 
We reasoned that use of such mild, carbon-centered 
free-radical chemistry (Fig. 1, B and C) would be 
enabled by matching free-radical reactivity with 
a suitable, uniquely reactive functional group 
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partner that possesses a chemical affinity for such 
singly occupied molecular orbitals (SOMOs). The 
amino acid residue dehydroalanine (Dha) can be 
readily introduced in a site-selective manner ge- 
netically (23, 24), biosynthetically (25), or chem- 
ically (26, 27) and is a potent “SOMOphile” (28), 
which, upon reaction with a suitable radical, 
would favorably generate a capto-dative stabi- 
lized Ca radical (Fig. 1C and Fig. 2). 


Development of a peptide-compatible 

C (sp*)-C (sp?) bond-forming reaction 
Our attention focused on methods for the ready 
generation of suitable carbon-centered free rad- 
icals and the suitable productive “quenching” of 
the central Co radical intermediate 1 generated 
after formation of the critical C-C bond (Fig. 2B). 
We considered that alkyl radicals might be de- 
rived from the corresponding alkyl halides (R- 
Hal) through processes that would generate 
single electron species either through direct 
homolytic bond fission (e.g., of the C-Hal bond) 
or through single-electron transfer from metals 
with suitable redox potentials in their low va- 
lence states (29, 30) [e.g., zine (31, 32) Zn° or in- 
dium (33) In®°] followed by halide anion loss. 

We chose leucine (Leu), an amino acid res- 
idue that has a widespread occurrence in pro- 
teins, as an initial test side-chain system, which 
we could potentially generate from the readily 
available simple organic compound isopropyl io- 
dide as a precursor (Fig. 2A, R = iPr). Pleasingly, 
reaction of Dha-containing derivatives and pep- 
tides as small-molecule models under Zn°- 
mediated conditions in aqueous buffer [pH 5 to 
6, NH,Cl (aq)] afforded the corresponding Leu 
derivatives directly (fig. S1 and supplementary 
materials). Generation of Dha from cysteine (Cys) 
followed by conversion to Leu thus allowed the 
overall “chemical mutation” of a residue (Cys > 
Leu) inside an intact peptide backbone; under 
optimized conditions (see supplementary mate- 
rials), this could be achieved in >90% yield and 
in less than 30 min. 

Extension of this methodology to a range of 
alkyl halides (R-Hal) enabled the synthesis of a 
variety of natural and unnatural amino acid res- 
idues, including unnatural aliphatic and cyclic 
structures (fig. S2). Notably, not only were pri- 
mary, secondary, and tertiary alkyl halides all 
tolerated, allowing installation of the natural 
simple and hydrophobic residue side chains, so 
too was the presence of polar protic (e.g., hy- 
droxyl and amine) functionality common in ami- 
no acid side chains. The use of these side-chain 
reagents proved possible even without protec- 
tion, thus highlighting not only exquisite chemo- 
selectivity but also compatibility with common 
biological functional groups and hence biological 
compatibility (orthogonality). Full characterization 
of the adducts (see fig. S1 and supplementary 
materials) confirmed absolute regioselectivity 
(>98% with diastereomeric ratio 55:45) for 
the radical addition, consistent with the de- 
signed, matched polarities of the radicals and 
the corresponding radical acceptor Dha, re- 
spectively (34). 
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Fig. 1. Design of a biocompatible C (sp*)-C (sp*) bond-forming reaction as part of a general strat- 
egy for chemical mutagenesis. (A) Natural posttranslational modification exploits C-heteroatom bond 
formation, such as C—N bond formation in Lys methylation, to diversify protein structure and hence 
modulate function. (B) Modification in a protein after translation, akin to PTM formation, but using C—C 
bond formation would allow construction of many side chains, not just the modification of existing natural 
amino acid residues. Retrosynthetic analysis (blue arrow) suggests a C—C B,y bond disconnection and 
reveals several possible “synthons.” For heterolytic two-electron reactions (shown in red), the resulting 
disconnection yields “synthetic equivalents” R* or R™ or reagents that would react and/or quench with the 
water in the necessary aqueous solvent or with example protein side chains shown. However, for the 
homolytic one-electron reaction (shown in blue), the resulting disconnection yields a free-radical “synthon” 
that would allow an equivalent Re for which compatibility can be envisaged. (©) A functional group with 
affinity for carbon-centered free radicals that was suitably polarized could act as the “synthetic equivalent” 
corresponding to the other protein-based “synthon” arising from this homolytic disconnection. The residue 
dehydroalanine (Dha) is the functional group proposed as a radical reactive “synthetic equivalent” that would 


allow corresponding C—C B,y bond formation. 


We observed that the predicted Co radical 
intermediate 1 (Fig. 2B), as well as displaying 
advantageous stability that would favor initial 
reaction, was sufficiently long-lived to allow fur- 
ther reaction with other radicals in unproduc- 
tive and unwanted termination reactions (fig. 
83). For example, reaction with a second alkyl 
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radical afforded disubstituted (“dialkylated”) pro- 
ducts, whereas reaction with molecular oxygen 
(which in its natively abundant state is the trip- 
let form °O, that may react with radicals) led to 
apparent oxidative protein cleavage (35). Nota- 
bly, additional amounts of reagents were seem- 
ingly needed for full conversion because of such 
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A Csp3-Csp$ bond-forming reaction for chemical protein modification 
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Fig. 2. Proposed radical reaction to allow posttranslational mutagenesis and suggested mecha- 
nism. (A) The required “side-chain free-radical” synthetic equivalent could be generated from a suitable 
radical precursor R-X and then reacted with the radical-reactive SOMOphile residue dehydroalanine 
(Dha) as a privileged unnatural amino acid “tag” to allow site-selective “chemical mutagenesis.” (B) This 
would generate the capto-dative stabilized intermediates 1. The proposed mechanism of free-radical Re 
generation illustrates the paths to efficient C-C bond-forming chain reaction and desired product (in 


blue) and competing side reactions and unwanted products (in red). 


observed competing processes (reduction of alkyl 
halide to alkane as well as disubstitution and 
oxidative degradation). The successful develop- 
ment of a radical reaction for protein modifi- 
cation (Fig. 2A) therefore necessitated a means 
of eliminating and controlling these undesired 
pathways. 


Mechanism-guided reaction 
development allowed optimization of 
protein-compatible C (sp*)-C (sp?) 

bond formation 

Next, two initial model proteins were selected to 
test radical reactivity on more complex extended 
polypeptides: a highly ordered three-layer o/B 
Rossman-fold serine protease [subtilisin from 
Bacillus lentus (SBL)] and a three-o-helix pro- 
tein representative of the histone fold that 
contains both ordered and disordered motifs 
(histone H3). Dha was installed (27, 36) site- 
selectively from corresponding single cysteine 
variants (see supplementary materials) to gen- 
erate radical acceptor sites at position 156 in 
SBL (SBL-Dha’”®) and at three separate sites 
(9, 27, and 64) in histone H3 (H3-Dha®, H3-Dha”’, 
H3-Dha™). These allowed us to test alterations 
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of the protein scaffold as well as variation of the 
reaction site within the same protein scaffold. 
Under essentially identical conditions to those 
used on small-molecule systems (isopropyl iod- 
ide, Zn° aqueous ammonium acetate buffer, 
DH 6), we observed direct Dha — Leu “chem- 
ical mutation” (fig. S4) in both SBL and his- 
tone H3. However, additional side products 
were also detected. Careful isolation, trapping, 
and characterization of these side products (figs. 
S5 to S8) revealed that they were the product of 
two competing pathways, both of which are 
consistent with the long-lived intermediacy of 
the Co capto-dative radical formed after addition 
of Re to Dha. The observed oxidative cleavage 
products, C-terminal amide 2 and dicarbonyl 3 
(figs. S5 to S7), would arise from the termination 
reaction of the Co radical with triplet oxygen 
(Fig. 2B) (37). Disubstituted (“dialkylated”) pro- 
ducts 4 were observed to arise from the ter- 
mination reaction of the Co radical with Re. 
Consistent with these analyses, “peptide map- 
ping” [tryptic tandem mass spectrometry (MSMS)] 
confirmed clean site selectivity of both the de- 
sired mutations and these side reactions: No 
residues other than that determined by the Dha 
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“tag” site were identified (fig. S8). Notably, these 
results were also wholly consistent with the 
corresponding Cy- and Ca-disubstituted products 
observed under comparable conditions from small- 
molecule models (see above) (fig. S3). These obser- 
vations in proteins therefore were also consistent 
with radical addition at Cy followed by termi- 
nation of the Co radical intermediate thus gen- 
erated (Fig. 2B). 

The generality of these parallel processes— 
chemical mutation with competing oxidative 
cleavage and disubstitution—was confirmed 
by its observation in both of the scaffolds (SBL 
and histone H3) and at several sites (e.g., Lys®, 
Lys”, and Lys™ within histone H3). These results 
not only highlight the need for an improved re- 
action with better control of radical addition, 
they also provide compelling chemical evidence 
for the intermediacy of the proposed Co. radical 
intermediate (Fig. 2B) and hence the (partial 
and initial) success of our designed C-C radical- 
forming reaction in proteins. 

These methodological observations and mech- 
anistic rationalization allowed us in turn to op- 
timize the balance between the desired radical 
side-chain addition and the unwanted competing 
side reactions (oxidative cleavage and disubsti- 
tution). In our initial reaction systems, our use of 
metal-mediated single-electron transfer exploited 
a system that relies on the redox potentials of the 
metals that were used not only in the initiation 
step but also in the subsequent second-electron 
transfer, which creates an enolate that is quenched 
by protonation (likely from solvent) (see fig. S9). 
In principle, more effective second-electron trans- 
fer to enhance enolate formation would allow 
more rapid formation of desired product at the 
expense of side reactions (quenching of the in- 
termediate Co radical before side reaction). Sur- 
vey of redox potentials (38, 39) suggested various 
metal potentials that might prove useful; of those 
that are compatible with water, indium sug- 
gested itself as a strong alternative candidate 
to zine standard reduction potentials E°/V, Zn°/” 
-0.76; In -0.14; In?” -0.40; In@” -0.49). This 
tuning of the single-electron donor, through the 
use of In° instead of Zn°, proved partially suc- 
cessful, yielding cleaner and more effective chem- 
ical mutation through radical addition (see fig. 
$10 and supplementary materials) on several but 
not all substrates. In particular, certain primary 
iodides (e.g., the side chains of MeArg and MeLys, 
where Me = methyl) were so reactive that un- 
wanted disubstitution side products remained. 

Next, we considered alternative methods for 
“quenching” of the intermediate Co radical. In 
principle, direct hydrogen atom (“radical hydride”) 
transfer (Fig. 2B) would not only enhance desired 
product formation but, by building a suitable 
chain reaction, could prove more efficient and 
sustainable. However, analysis of the putative 
chain cycle (Fig. 2B) highlighted that this would 
require selective increase of the rate of this step 
(encompassed by app) to a greater extent than 
that of not only the oxidative and disubstitution 
side reactions (Kzapp and Kapp, respectively) but 
also over that of the direct reduction of alkyl 
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Fig. 3. Biocompati- 
ble C (sp?)-C (sp*) 
bond-formation 
allows wide-ranging 
chemical mutagen- 
esis. Application 

of the optimized bond- 
forming reaction 
allows direct installa- 
tion of side chains 
onto Dha tags found in 
multiple, representa- 
tive protein scaffolds 
and at different sites 
within the same 
scaffolds. Attachment 
of polar, hydrophobic, 
nonpolar, ionized, and 
modified side chains 
with natural motifs 
black), with natural 
modifications (blue) 


motifs (red) were all 
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iodide (R-I — R-H, Koapp) (Fig. 2B). A range of 
traditional “radical hydride” sources were screened 
(e.g., RgSnH, R3SiH, RSH), yet none proved useful 
and all appeared to favor Koapp Over kyapp. We 
reasoned that these bulkier hydride sources 
preferentially transfer hydride He to less bulky 
radicals (such as direct transfer to Re, thereby 
favoring Kapp), and so we next tested less hin- 
dered hydride sources (which might be able to 
access the more hindered intermediate Ca rad- 
ical 1, as desired). Although borohydrides RBH; 
are traditionally viewed as nucleophilic hydride 
sources, rare studies on these (40-42) and re- 
lated aluminum hydrides (43) have previously 
suggested possible radical behavior under cer- 
tain circumstances (44). Strikingly, we found that 
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NaBH, in aqueous solution proved to be a highly 
effective reagent, allowing improved efficiency 
(as judged by the need for reduced equivalents 
of iodides) and by the quality of protein chem- 
ical mutation products (fig. S11). The radical 
nature of this controlled, clean, and efficient re- 
action was confirmed not only by direct obser- 
vation of radicals by electron paramagnetic 
resonance (EPR) (fig. $12) but also through the 
use of radical trapping; the reaction was fully 
inhibited by substoichiometric 4-hydroxy-TEMPO 
(4-hydroxy-2,2,6,6-tetramethylpiperidin-1-oxyl) or 
acrylamide (fig. S13 and supplementary materials). 

Finally, having suppressed competing dis- 
ubstitution, we were able to efficiently suppress 


competing oxidative cleavage simply through the 
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removal of molecular oxygen from the buffer 
solutions in which we conducted “chemical muta- 
genesis” reactions; controlled equilibration exper- 
iments at a variety of oxygen partial pressures 
(figs. S14 and S15) revealed that incubation at 
<6 ppm O, for 6 hours prior to reaction proved 
generally sufficient. Application of the combined 
optimized conditions led directly to clean “chem- 
ically mutated” proteins without side reactions 
that, when sequenced by MSMS, were interpreted 
directly as the intended mutation (fig. $16). Both 
whole-protein liquid chromatography-MS and 
post-digestion MSMS (figs. S17 and S18) con- 
firmed that there was no nonselective alkylation 
or overalkylation by the halide reagents on 
other residues in proteins substrates. Additionally, 
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Fig. 4. Use of C-C bond-forming mutagenesis to create N- and O- 
glycosylated proteins. Using a common intermediate, H3-Dha®”, both forms 
of GIcNAc (O-linked and N-linked) could be readily introduced and their be- 
havior in extension and cleavage by relevant glycan-processing enzymes tested. 
(A) Despite the position in histone H3, which is not normally glycosylated, N- 
linked GlcNAc was readily extended to either a disaccharide (Gal-GIcNAc, 
LacNAc) or the core pentasaccharide (found in all natural N-linked glycans) 
by appropriate enzymatic systems (GalT and EndoA, respectively). However, 
this site proved resistant to enzymatic cleavage under conditions that led to 


x » EN 39035 


we 
a 
= 
. 


full cleavage in natural N-glycosylated sites (conditions, PNGase, 2 M urea, 
pH 8.0). (B) As for N-linked glycosylation, the disaccharide LacNAc (Gal-GIcNAc) 
and the core pentasaccharide were readily formed from O-linked GIcNAc, de- 
spite the unnatural site. Strikingly, O-GlcNAcase from a range of sources 
showed cleavage activity even at unnatural sites and notably with the human 
enzyme hOGA. (C) Glycosylation of varied protein platforms, antibody cAbLys, 
efflux protein AcrA, and pentapeptide repeat protein Nf all proved possible. 
Experimental data reported in bar graphs represent the normalized percentage 
(mean + SD) of cleaved substrate, taken in triplicate. 


control experiments in which NaBH, was omit- 
ted from the reaction mixture showed that the 
halide reagents alone did not react with pro- 
teins under the standard conditions. 


Biocompatible C (sp*)-C (sp?) bond- 
formation enables a “toolbox” for 
building natural and unnatural proteins 


With optimized conditions for C-C bond for- 
mation enabling chemical mutation in hand, we 
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next explored the breadth of the side chains 
that could be introduced (and hence mutations 
that would be accessible) and the protein scaf- 
folds that they could be introduced into (Fig. 3). 
Representative proteinogenic and nonprotein- 
ogenic; polar, nonpolar, aromatic, ionized, and 
modified amino acid residues, bearing both nat- 
ural and unnatural motifs, were chosen and 
all readily incorporated. This demonstrated the 


tolerance of the reaction to many of the most 


common amino acid (and biological) functional 
groups: hydroxyl (OH), aminyl (NH), guanidine, 
amide, thereby highlighting its excellent biocom- 
patibility (orthogonality). Choice of the corre- 
sponding organic iodide, typically readily available 
commercially or through chemical synthesis, al- 
lowed wide-ranging and systematic variations. 
Thus, not only could Leu be incorporated, so 
could a series of systematic variants of Leu: 
demethyl-Leu (smaller), tevt-Leu (bulkier), no7-Leu 
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Fig. 5. Use of C-C bond-forming mutagenesis to build phosphorylated 
histone proteins. Formation of natural and synthetic phosphorylated histone 
H3 (H3-cpSer’® and H3-cf2pSer"”) via enzymatic phosphorylation and chem- 
ical mutagenesis, respectively. (A) Natural phosphorylation is catalyzed by 
Aurora B kinase and gives rise to a mixture (inset to mass spectrum) of 
phosphoforms. (B) Chemical phosphorylation via C—-C bond formation gives 
essentially homogeneous products H3-cpSer!° and H3-cf2pSer?©. (C) West- 
ern blot analysis with antibody to H3-pSer!° and SDS-PAGE gel stained with 
Coomassie blue (for full gel, see fig. S29) shows that the phosphomimics carba- 
pSer (cpSer) and difluorocarba-pSer installed by building a C—C bond are faith- 
fully recognized as phosphorylated by the corresponding antibody raised to bind 
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the natural modification (pSer). (D and E) A comparison of stability of H3- 
pSer!° and H3-cpSer’? under the actions of five different protein phospha- 
tases shows that the C-C bond in H3-cpSer?® provides full resistance even to 
enzymes that fully and readily degrade the naturally phosphorylated H3-pSer?°. 
The y axis shows normalized percentage (mean + SD) of phosphorylated pro- 
tein taken in triplicate. (F) Alpha-screen binding assay using phospho-reader 
protein MORC3 shows binding by H3-cpSer!° and enhanced binding by H3- 
cf2pSer'°. Measurements were performed at least three times and were ana- 
lyzed by Student's t test at 95% confidence interval [H3-cf2pSer!° > H3-WT 
in binding, P = 0.0285 (3 eq.) and P = 0.0004 (6 eq.), respectively; H3-cpSer!° 
> H3-WT, P = 0.0291 (6 eq.)]. Error bars indicate SEM. 
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Fig. 6. Use of C-C 
bond-forming muta- 
genesis to build 
methylated nucleo- A 
somes. Formation of 
methylLys-modified 
(H3K9me3) or 9 
methylArg-modified 
(H3R26meza) nucleo- 
somes by chemical 
mutagenesis enables 
insight into the 
biological functions of 
key histone modifica- 
tions. (A) Chemical 
methylation via C-C 
bond formation allows 
installation of not only 
K9mez3 with natural 
isotope distribution 
(primarily *7C) but also 
a 8C-enriched variant 
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algorithm to solutions of a simplified first-order model. (B) C-C bond-forming 
mutagenesis also enabled the direct site-specific installation of asymmetric 
dimethylarginine residues into intact histones. H3R26meza was cleanly 
installed via C-C bond formation (see supplementary materials), assembled 
into nucleosomes. When used to probe human (HeLa) cell extracts for inter- 
action partners (three independent biological and two technical replicates), 


previously unanticipated protein partners implicated i 


(slimmer), and “cyclo”-Leu (conformationally 
restricted) (Fig. 3). Similarly, systematic vari- 
ation of side chain length, methylene unit by 
methylene unit, was also possible (e.g., Ala, ethyl- 
Gly, demethyl-Leu, zor-Leu). We were also able 
to strategically replace, with atomic precision, 
methyl groups in residues with their labeled 
or precisely altered variants: thus, CH; — CF; 
(e.g., in demethyl-Leu) or CH; > “CH; (eg., in 
trimethyl-Lys). Current methods for isotopic label- 
ing based on “feeding” experiments result in 
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universal incorporation at every codon-determined 
site; here, isotopic labels can be installed at a 
single site. Such precisely fluorinated or isotopi- 
cally labeled amino acids are not only powerful 
biophysical reporters (45-47), with use particu- 
larly in protein (e.g., °F and °C) nuclear mag- 
netic resonance (NMR) methods, but can also 
act as modulators of protein structure and bind- 
ing (48). Indeed, use of °F NMR allowed us to 
further confirm both the regioselectivity and 
stereoselectivity for chemical mutations (fig. S19), 


H 
\ 


Nucleosome 
assembly a fe 


Nucleosome 
assembly 


recognition of DNA damage were identified among 797 quantified proteins in 
nuclear extract, which suggests that ablation of DNA-to-H3 hydrogen bonding 
may be critically affected by such R26 methylation. Significant interacting 
protein partners (denoted by their gene name) identified upon label-free 


n Student's t test analysis are shown in red in the 
axis, logarithmized ratio of LFQ intensity difference 


among two groups; y axis, logarithmized P value from the test statistics). 


which proved to be essentially identical to that 
found on peptide models (>98% and diastereo- 
meric ratio ~1:1). 

Key posttranslational modifications also proved 
accessible, including glycosylation (2) (in O- and 
N-linked form), Lys methylation (49) (in all three 
states: mono-, di-, and tri-, as well as labeled 
tri-°C), Arg methylation (50) (mono- and di-), 
and Gln methylation (57). Notably, no other chem- 
ical methods exist (barring total protein synthe- 
sis) for the installation of the majority of these 
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residues. Moreover, several residues that have 
been previously biologically inaccessible in pro- 
teins were also readily introduced. These included 
ornithine (Orn), which by virtue of intramolecular 
cyclization chemistry cannot be loaded onto tRNAs 
and hence is incompatible with ribosomal incor- 
poration (52), and di- and trimethylated Lys that 
cannot yet be incorporated into proteins [even 
indirectly, as mono-methyl-Lys currently is (53)] 
by cellular stop-codon suppression (13). 

Finally, we surveyed the introduction of side 
chains to representative examples of protein 
functions (structural, channel, enzyme, glyco- 
protein) from differing protein folds (with vary- 
ing levels of o, 8, and unstructured secondary 
motifs) and species types. Thus, as well as SBL 
and histone H3, we also surveyed the variously 
structured histone H4; the transmembrane bac- 
terial efflux component protein AcrA (54); p38a 
mammalian mitogen-activated MAP kinase (55); 
the mammalian antibody cAbLys3 (56); the 
apoptosis marker binding protein annexinV (57); 
and the pentapeptide-repeat protein NpB (58). 
These also represent proteins that are variously 
associated with localization in different cellular 
environments: nuclear (histones H3 and H4), 
cytosolic (p38a, annexinV), transmembrane (AcrA, 
Np§), and extracellular (cAbLys3, annexinV). They 
also allowed us to survey highly diverse ar- 
chitectures: a helix-coiled coil (AcrA); histone 
fold (mixed unfolded and o helix, histones H3 
and H4); mixed a/f fold (SBL, p38q); variable- 
domain immunoglobulin fold (four-strand Bp 
sheet plus five-strand B sheet, cAbLys); a helix- 
rich globular annexin fold (four x 5-a-helix do- 
mains in a “super-helix,” annexinV); and even a 
right-handed quadrilateral B helix (NpB). Sites 
of modifications sat within various feature types 
including helix, sheets and loose loops (table $2). 
Notably, all proteins proved compatible with 
side-chain attachment at all targeted sites (see 
supplementary materials for full characteriza- 
tion details). Multiple sites in the same pro- 
teins were also surveyed: five different sites in 
histone H3 (sites 9, 10, 26, 27, and 64) and two 
in histone H4 (sites 16 and 17). Together, these 
cumulative variations of side chains in differ- 
ent protein substrates at different sites allowed 
access to >50 “chemical mutants.” 

Analysis of sequence (e.g., by tryptic MSMS 
peptide mapping; fig. S20), structure (e.g., sec- 
ondary structural content by circular dichroism, 
amide- and aliphatic-resonance analysis and dif- 
fusional analysis by protein H-NMR; figs. S21 
to $23) and retention of function (e.g., enzymatic 
activity, antibody-binding function and biomarker 
recognition, protein complex assembly), not only 
confirmed the site selectivity of the C-C bond- 
forming chemical mutation but also suggested 
no gross alteration of global structure (see sup- 
plementary materials), although it should be 
noted that localized structural changes cannot 
be discounted. Notably, too, test experiments 
in both model systems and proteins containing 
disulfides suggested good compatibility of re- 
agents [including 2,5-dibromohexane diaceta- 
mide (DBHDA), consistent with prior results (27), 
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and NaBH,; see supplementary materials and 
figs. S24 and S25] with disulfides. 


C (sp*)—C (sp) bond-formation chemical 
mutation enables diverse techniques for 
the study of protein methylation, 
glycosylation, and phosphorylation 


With this ability to directly insert side-chain al- 
terations and hence perform chemical mutagen- 
esis on proteins with wide variation in protein 
and site, we chose to test differing proof-of- 
principle strategies that would allow insight 
into the biological function of posttranslation- 
ally modified residues (and their mimics) that 
are ordinarily difficult to install with fidelity 
into proteins by other means. To this end, we 
chose three of the most important PTMs: gly- 
cosylation, phosphorylation, and methylation. 
Glycosylation is the most diverse of the post- 
translational modifications (7, 59), yet has been 
prominently absent as a readily accessible motif 
in proteins via chemical or genetic mutagen- 
esis methods (60). Until now no general chem- 
ical method for convergently installing N- and 
O-linked glycans has been possible (59, 60). 
We used C-C bond-forming mutagenesis along 
with corresponding (entirely unprotected) N- 
and O-linked glycosidic iodides to install N- 
acetylglucosamine (GlcNAc), a glycan that is 
found naturally in both N- and O-linked form, to 
create an unnatural glycosylation site at position 
27 of histone H3 (using the same, common diver- 
gent histone H3-Dha”” protein intermediate, Fig. 4). 
Despite the unnatural constitution of this site, 
enzymatic extension with either glycosyltrans- 
ferase or endoglycosidase allowed the overall 
installation of more complex glycans onto both 
N- and O-linked GlcNAc, even up to the N-linked 
core pentasaccharide that is found in all N-linked 
glycoproteins (1, 59). We discovered that despite 
an apparent plasticity with respect to protein 
scaffold (67), PNGase (peptide-N-glycosidase), 
a widely used N-glycosidase (62), did not cleave 
synthetic variants with extended side-chain 
length (Fig. 4A and fig. S26). In striking con- 
trast, we discovered that a variety of synthetic 
O-GlcNAcylated glycoproteins were cleaved quite 
readily by O-GlcNAcases from different sources 
(Fig. 2B and fig. $27), including the human 
O-GlcNAcase (hOGA) enzyme. The latter, which 
is implicated in diabetes (63), dementia (64), and 
cancer (65), has until now been presumed selec- 
tive, given that it is the sole encoded protein 
O-GlcNAcase in the human genome (66); our 
results suggest a previously unappreciated and 
surprising plasticity. In addition to the N- and 
O-glycosylation of histone H3, we were able to 
chemically N-glycosylate and O-glycosylate other 
sites and proteins, including the naturally gly- 
cosylated protein AcrA, the pentapeptide repeat 
protein NB, and notably the heavy-chain antibody 
cAbLys with the putative Fcy receptor ligand 
glycan ManzGlcNAc, (Fig. 4C and fig. $28). 
Aurora B kinase, which is overexpressed in 
cancers (67), phosphorylates Ser’ of histone H3 
during mitosis (68) and is hence implicated as a 
controlling factor in cell division and proper dis- 
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tribution of genetic information. The lability of 
phosphorylation and the mixtures of phosphoryl- 
ated proteins often formed from natural enzy- 
matic phosphorylation greatly complicates the 
study of phosphoryl groups on given sites, such 
as H3-pSer’® (69). We used C-C bond-forming 
mutagenesis along with corresponding (notably 
unprotected) iodophosphonates to create stable 
analogs of H3-pSer’® in which a single oxygen 
atom was replaced by methylene or difluoro- 
methylene units to create the carba-phosphoSer 
variants cpSer (70) and cf2pSer (71) (Fig. 5A); 
MS analysis revealed that this could be achieved 
with a purity that is not possible with current 
biological methods for phosphorylating histone 
H3 (Fig. 5B). Antibodies and appropriate bind- 
ing proteins [such as the “reader” protein MORC3 
(72)] not only recognized the resulting phospho- 
mimic proteins (Fig. 5, C and F, and fig. $29) but 
also proved stable to either chemical or enzyme- 
catalyzed dephosphorylation (Fig. 5, D and E, 
and fig. S30), even with phosphatases that readily 
processed naturally phosphorylated histone H3 
(e.g., protein phosphatases 1 or 2A). It also proved 
possible to readily install carba-pSer into other 
proteins (fig. $31). Although cpSer” proved to be 
a functioning mimic of pSer”, it has been argued 
(73, 74) that fluorophosphonates may act as 
more effective phosphate mimics by virtue of 
pK, (7D), polarity, and shape (75). The C-C bond- 
forming mutagenesis allowed us to also install a 
difluorophosphonate-Ser variant (difluoro-carba- 
psSer, cf2pSer) at the same site of histone H3 to 
create H3-cf2pSer™. Consistent with improved 
mimicry, this variant showed enhanced binding 
to MORC3, validating proposed (71, 73-75) difluoro- 
phosphonate mimicry of phosphates in proteins 
(Fig. 5F). 

Protein methylation (76) is a central biological 
process (e.g., in epigenetic regulation and cell 
signal transduction). Yet the precise elucidation 
of the functional mechanistic role of methylation 
at the molecular level and the ready delineation 
of proteins associated with the “methylproteome” 
(e.g., “writers,” “readers,” and “erasers”) remains 
a grand challenge in biological science. We chose 
two methylated protein targets that have not 
been possible to create through other protein 
generation methods and created these through 
the site-selective C-C bond-forming mutagene- 
sis reaction: a site-selectively C-labeled variant 
of trimethylated Lys in histone H3 (Fig. 6A) and 
a dimethylated Arg residue site-selectively in- 
stalled into in an intact nucleosomal particle 
(Fig. 6B). We created H3-[?C-Me] -Lys® (Fig. 6A) 
with precise trimethylation on Lys®, which has 
been previously been observed and implicated 
in direct transcriptional regulation (77, 78). The 
structure and function of this chemical mutant 
of histone H3 was explored by protein MS and 
NMR. These revealed ready observation of the 
3C-labeled protein, by virtue of its isotopic la- 
bel, and the readily observable processing and 
release of the “C-labeled methyl groups ((°C] 
Me) from (['?C]Me);-Lys® in histone H3 by the 
known demethylation enzyme JMJD2A/KDM4a 
(Fig. 6A and fig. S32A) (79, 80). 
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Having installed and explored precise Lys 
methylation in an isolated histone protein, we 
explored precise Lys and Arg methylation in the 
context of an entire, intact, nucleosomal particle 
(Fig. 6, A and B); both LysMe3 and ArgMe2a 
were readily introduced. Both anti-LysMe3 anti- 
body recognition and JMJD2a/KDM4a-catalyzed 
demethylation were readily demonstrated using 
synthetically methylated nucleosome (fig. $32B). 
Arg methylation, and especially asymmetric di- 
methylation (ArgMe2a), remains only a partially 
understood alteration. Moreover, given the variant 
isomeric forms of Arg methylation and dimethy- 
lation, precise control of the installation of meth- 
ylation to create representatively methylated 
proteins has also not proved possible. Using the 
C-C chemical mutagenesis method (Fig. 6B), we 
precisely installed ArgMe2a into site 26 of his- 
tone H3 (H3R26meza) [a site implicated (87) in 
so-called “cross-talk” epigenetic modifications 
with an order that is not clearly understood] in 
intact nucleosomal particles. 

These synthetic nucleosomal probes bearing 
asymmetric-dimethylation at H3R26me,a allowed 
the identification of key partner proteins through 
affinity enrichment proteomics in human cells 
(Fig. 6B and table S1). Notably, two of the stron- 
gest interacting partners, BEND3 and BANP, 
contain BEN domains, a recently characterized 
a-helical module found in chromatin-associated 
proteins (82); BEND3 (83), a novel rDNA tran- 
scription repressor, is the most enriched inter- 
acting protein partner. Interestingly, BEND3 can 
recruit PRC2 complex that promotes H3K27me; 
modification, a known transcription repression 
marker (84); this suggests potential cross-talk 
between these two modifications leading to gen- 
eration of a repressive chromatin state. Intrigu- 
ingly, a majority of the remaining significant 
interactors (SMARCALI, RECQL, DDB1, DDB2, 
TOP3A) are annotated as being involved in DNA 
replication/repair. Although a link between his- 
tone arginine methylation and DNA repair has 
not been previously reported, the results of this 
experiment suggest increased accessibility of 
nucleosome-bound DNA to a range of relevant 
DNA-binding effectors; such a loss of DNA-to- 
arginine hydrogen bonding would be antici- 
pated upon methylation. Such is the flexibility 
of the C-C chemical mutagenesis method that 
it also proved possible to readily install methyl- 
ated Lys (LysMe, LysMe2, LysMe3), methylated 
Arg (ArgMe2a, ArgMel), and methylated Gln 
into a wide range of other sites and proteins 
(fig. S33). On the basis of the ready discovery of 
previously unanticipated interacting partners 
for Arg26Me2a (see above), we anticipate that 
these too will prove to be powerfully precise 
probes of methylation function and direct “methyl- 
proteome” interactions. 

Finally, we tested the scalability of the C-C 
chemical mutagenesis method; applications 
such as translational trials of biologics and struc- 
tural biological studies can necessitate larger 
(multimilligram) amounts of proteins. Reac- 
tion of 10 mg of the Dha protein H3S10Dha 
with the relevant bromide and sodium boro- 
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hydride allowed the installation of difluorome- 
thylenephosphohomoalanine (cf2pSer) in useful 
“synthetic” (isolated, 65 to 70%) yield after de- 
salting thereby suggesting utility even for larger- 
scale synthetic applications in protein chemistry 
(fig. S34). 


Discussion 


The compatibility in the use of radicals as ef- 
fective and yet benign reactive intermediates for 
protein modification suggests that other radical- 
based methods (35) may prove powerful in the 
field of protein chemistry. Indeed, the key im- 
plicated propagating intermediate in our C-C 
bond-forming mutagenesis, the capto-datively 
stabilized Ca backbone radical, is similar to 
other stabilized radicals suggested in natural 
processes (35); not only may nature have long 
been taking advantage of such methods, but 
other routes of access to such intermediates 
could allow similarly powerful bond-forming 
strategies. This protein radical chemistry is like- 
ly to require new reagents such as boron-based 
compounds, which have played an important 
role in radical chemistry (85, 86). The asso- 
ciated mechanisms of these reagents remain a 
topic of active debate (43, 44), and mechanisms 
other than the one we propose here cannot be 
discounted. 

Given their benign application and compat- 
ibility, we can also envisage the ready combina- 
tion of the C-C mutagenesis reaction with other 
protein chemistries or assembly methods. For 
example, thioester-mediated backbone assembly 
methods (“native chemical ligation”) (15) typi- 
cally use peptide fragments with N-terminal Cys 
residues that remain at the junction point after 
“ligation.” Using the C-C mutagenesis reaction, 
these Cys residues could be readily converted to 
Dha and thence to almost any residue of choice. 
Indeed, in a proof-of-principle of such a combined 
strategy, we have been able to generate Thr- and 
Leu-containing peptides in which the Thr-Leu 
moiety is derived from thioester-mediated amide 
ligation with Cys followed by C-C chemical muta- 
genesis to Leu (see fig. S35 and supplementary 
materials). 

Because the simple choice of reagent allows 
for the variation of single-atom substituents 
(e.g., O — CH» — CF, in pSer, cpSer, and cf2pSer, 
as we have shown here), our approach may en- 
able a more molecular understanding of mech- 
anism of proteins and a fine-tuning of function. 
The use of stable phosphoryl mimics (such as 
cpSer or cf2pSer) suggests itself as a promising 
way to “fish” not only for phospho-binders 
(“phospho-readers,” as we have shown here) 
but for enzyme partners that would process 
their natural modification counterparts (e.g., 
phosphatases that would cleave pSer but, as we 
have shown here, cannot cleave cpSer). It should 
be noted in this context that although amber- 
codon suppression (“genetic code expansion”) 
methods have proven highly powerful in certain 
cases, many useful and biologically relevant struc- 
tures elude current approaches. For example, the 
residue types (“°C-trimethyl-Lys, dimethyl-Arg, O- 
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and N-linked glycosylated residues, difluorocarba- 
or carba-pSer) installed in the proof-of-principle 
studies (see above) have proven intractable to 
direct installation into biologically relevant sites 
by amber-codon suppression (87). Indeed, resi- 
dues bearing only small differences from their 
natural counterparts (methylated Lys, Arg, Gln), 
which we were able to incorporate chemically, 
sometimes prove difficult to incorporate by such 
genetic methods because of their strong struc- 
tural resemblance in the translational machin- 
ery to their unmodified counterparts. 

Note that although the C-C bond-forming 
mutagenesis method described here allows the 
creation of the relevant constitution (connectivity) 
found in proteins, our analysis of the stereo- 
selectivity in both peptides and proteins (see 
above) suggests approximately equal formation 
of both p- and t-configurations at the mutation 
site. This would result in yet more complex 
mixtures of protein diastereomers for multiple 
mutation sites. 

The improved access to relevant protein 
architectures will potentially reveal surprising 
biological functions. For instance, our data have 
revealed that the human enzyme hOGA is more 
plastic in its cleavage activity of O-GlcNAcylated 
proteins than had been previously anticipated. 
Coupled with inferred plasticity of the corre- 
sponding glycosyltransferase (OGT) that was 
recently suggested by studies on peptides (88), 
this suggests that any O-GlcNAcylation-associated 
regulatory mechanism may be much looser than 
previously realized. The elucidation of new bind- 
ing partners for nucleosomes in previously in- 
accessible methylation states (e.g., asymmetrically 
dimethylated ArgMe2a) suggests that many other 
new interactions may be discoverable via C-C 
chemical mutagenesis. These in turn will allow us 
to identify and synthetically “program” into pro- 
teins exactly those residues that engender wanted 
functional (e.g., pharmaceutical) benefit in a truly 
broad manner. For instance, the chemical glyco- 
sylation (via C-C bond mutagenesis) of an anti- 
body fragment with possible sugar ligands for 
the Fcy receptor raises the possibility of new 
cell-killing strategies (89) mediated by synthe- 
tic antibody fragments. In this way, we envi- 
sage access to synthetic proteins that will allow 
application of newly elucidated functions in, 
for example, new protein drugs or “synthetic bio- 
logics” (90). 


Materials and methods 
General Dha formation on histone 
protein substrate 


The relevant protein cysteine mutant (10 mg 
of lyophilized powder) was dissolved in 500 nL 
of reaction buffer (3M guanidinium hydrochloride, 
100 mM sodium phosphate, pH 8.0) and 30 mg 
DTT was added. The resultant protein solution 
was shaken at room temperature for 30 min. 
Excess DTT was removed by passing the solu- 
tion through a PD MiniTrap G25 (equilibrated 
with the same buffer). Eluting with 1 mL of the 
reaction buffer gave the protein at a final con- 
centration of 10 mg/mL. To the reduced protein 
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mutant was added 50 uL of a 2,5-dibromohexane 
diacetamide (DBHDA) stock solution (0.5 mM, 
100 mg in 610 uL of DMF). The reaction was 
gently shaken for 45 min at room temperature 
and then 90 min at 37°C. The reaction mixture 
was concentrated to 500 uL using a Vivaspin 
500 concentrator (MWCO 10 kDa) and excess 
reagent was removed by passing the solution 
through a PD MiniTrap G25 (equilibrated with 
chemical mutagenesis reaction buffer). Elut- 
ing with 1 mL of the same buffer provided the 
dehydroalanine-containing protein. 


General chemical mutagenesis method 


A 2 mL eppendorf tube containing a solution 
of protein (typically 0.2- 5 mg/mL, 200 uL, ~ 10- 
200 uM) in reaction buffer (pH 4-8, most com- 
mon buffers tolerated) was ported into a Ny 
atmosphere glovebox (Belle Technologies) and 
equilibrated to anaerobic conditions (cap open) 
overnight at 4°C in a standard benchtop ep- 
pendorf cooler. In parallel, a stock solution of 
the bromide or iodide of interest (prepared by 
dissolving solids in water and adjusting the pH 
to that of the reaction buffer), or the solid re- 
agent itself, was also ported into the glovebox. 
In the case of liquid reagents, either a stock solu- 
tion was prepared as above, or the neat reagent 
was ported into the glovebox and stored at the 
appropriate temperature. Solid sodium boro- 
hydride (0.3 - 1 mg, smaller amounts of sodium 
borohydride can be difficult to weigh accurately) 
was ported into the glovebox in a 2 mL eppendorf 
tube and placed in the cooler kept at 4°C. Fol- 
lowing a minimum of 8 hours degassing, inside 
the glovebox, the volume of the protein solution 
was ascertained by pipette and “topped up” with 
reaction buffer to the desired volume (e.g., 200 nL). 
1-20 pL of the previously prepared iodide or 
bromide stock solution (typically 100 - 2000 
molar equivalents relative to protein) was then 
pipetted into the protein solution, and the re- 
sultant solution gently mixed by pipette. The en- 
tire volume (~200 uL) of the protein solution 
was then added to the 2 mL eppendorf tube kept 
at 4°C and containing solid NaBH,. The tube was 
gently shaken by hand, causing effervescence/ 
foaming to spread through the solution. Any 
discoloration from the iodide reagent should 
rapidly disappear (In some cases, particularly 
at high pH where the hydrolysis of borohydride 
is retarded, sustained effervescence is observed, 
whereas for most iodides a colorless and still 
solution is obtained after ~20 min incubation 
at 4°C). The reaction was incubated at 4°C for 
30 min (cap open) before being removed from 
the glovebox. Shaking is not necessary but may 
assist in removing evolved gases. The eppendorf 
tube was then capped and removed from the 
glovebox. Outside the glovebox, the reaction so- 
lution was loaded onto SpinTrap G-25 desalting 
column, pre-equilibrated with the desired stor- 
age buffer. Elution according to manufacturers 
instructions afforded the desired chemical muta- 
genesis product, which was analyzed by LC-MS. 
For downstream applications, additional SEC 
may be necessary for sample desalting. 
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INTRODUCTION: Transcription factors (TFs) 
are often studied one by one or in clusters of a 
few related factors. However, the integration 
and networks of transcriptional changes to 
response to environmental stresses often in- 
volve many related TFs. In many organisms, 
such as plants, overlapping functions can make 
it difficult to understand how a biologically re- 
levant end result can be achieved via the com- 
plex signaling networks controlled by these TFs. 
To better understand how the reference plant 
Arabidopsis deals with the stresses incurred by 
water limitation via the hormone abscisic acid 


(ABA), we characterized all DNA sequences 
that bind to the 21 ABA-related TFs in vivo. 


RATIONALE: There have been limited system- 
atic studies of stress-responsive TF networks 
in multicellular organisms. We chose ABA, an 
essential plant hormone that is required for 
both development and responses to osmotic 
stress, as an elicitor to investigate complex gene 
regulatory networks under stress. Combining 
differential binding (DB) of 21 ABA-related TFs 
at a single time point measured by chromatin 
immunoprecipitation sequencing (ChIP-seq) 


High salinity or ABA 
treatment 


Functional 
validation 


Transcriptional landscape of the ABA response. ABA response pathway gene targets were 
identified by large-scale ChIP-seq and time-series RNA-seq experiments. A network model was 
built to reveal the hierarchy of TFs and the impact of multi-TF dynamic binding on gene expression. 
A new family of transcriptional regulators was predicted by the model and was functionally 
tested to evaluate the role of these regulators in osmotic stress in plants. 
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with differentially expressed genes from a time- 
series RNA sequencing (RNA-seq) data set, we 
analyzed the relationship between DB of TFs 
and differential expression (DE) of target genes, 
the determinants of DB, and the combinatorial 
effects of multi-TF binding. These data sets also 
provide a framework to construct an ABA TF 
network and to predict genes and cis-regulatory 
elements important to ABA responses and re- 
lated environmental stresses. 


RESULTS: We found that, in general, DNA 
binding is correlated with transcript and pro- 
tein levels of TFs. Most TFs in our study are 
induced by ABA and gain binding sites (termed 
“peaks”) after the hormone treatment. ABA 

also increases the binding 
of the TFs at most peaks. 
Read the full article However, in some peaks, 
at http://dx.doi. TF binding may be static 


org/10.1126/ or even decrease after ABA 
science.aag1550 exposure, revealing the com- 
Seed ae Ne plexity of! ieeene 


regulation. De novo motif discovery enabled 
us to identify distinct, primary motifs often 
centrally localized in the ChIP-seq peaks for 
most TFs. However, it is not uncommon to find 
motifs, such as the G-box, that are shared by 
peaks from multiple TFs and may contribute 
to binding dynamics at these sites. DB of multi- 
ple TFs is a robust predictor of both the DE and 
ABA-related functions of the target genes. Using 
the DB and DE data, we constructed a network 
of TFs and canonical ABA pathway genes and 
demonstrated a regulatory hierarchy of TFs 
and extensive feedback of ABA responses. On 
the basis of a “guilt-by-association” paradigm, 
we further predicted genes whose functions 
were previously not linked to ABA responses, 
and we thus functionally characterized a new 
family of transcriptional regulators. 


CONCLUSION: These data sets will provide 
the plant community with a roadmap of ABA- 
elicited transcriptional regulation by 21 ABA- 
related TFs. We propose that dynamic, multi-TF 
binding could be a criterion for prioritizing 
the characterization of TF binding events, cis- 
regulatory elements, and functionally unknown 
genes in both plants and other species. In our 
proof-of-principle experiments, ectopic expres- 
sion of the transcriptional regulators ranked 
highly in our model results in altered sensi- 
tivity to both ABA and high salinity. Together 
with the fact that our modeling recovered 
genes related to seed development and osmotic 
stresses, we believe such predictions are likely 
applicable to a broad range of developmental 
stages and osmotic stresses. 


The list of author affiliations is available in the full article online. 
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Environmental stresses are universally encountered by microbes, plants, and animals. Yet 
systematic studies of stress-responsive transcription factor (TF) networks in multicellular 
organisms have been limited. The phytohormone abscisic acid (ABA) influences the expression of 
thousands of genes, allowing us to characterize complex stress-responsive regulatory networks. 
Using chromatin immunoprecipitation sequencing, we identified genome-wide targets of 

21 ABA-related TFs to construct a comprehensive regulatory network in Arabidopsis thaliana. 
Determinants of dynamic TF binding and a hierarchy among TFs were defined, illuminating the 
relationship between differential gene expression patterns and ABA pathway feedback regulation. 
By extrapolating regulatory characteristics of observed canonical ABA pathway components, 
we identified a new family of transcriptional regulators modulating ABA and salt responsiveness 
and demonstrated their utility to modulate plant resilience to osmotic stress. 


ranscription is a key step in gene expres- 

sion. There have been concerted efforts to 

map functional elements in human, fly, and 

worm (J-3), including a large number of 

cis-regulatory elements identified by profil- 
ing transcription factor (TF) binding using chro- 
matin immunoprecipitation sequencing (ChIP- 
seq). One area that remains largely unexplored is 
how a stimulus modulates TF binding and sub- 
sequent transcriptome changes. Furthermore, 
compared with studies in animals, very few com- 
prehensive in vivo TF binding data sets are avail- 
able for the Plantae kingdom. To begin to address 
this knowledge gap, we generated more than 100 
ChIP-seq and time-series RNA sequencing (RNA- 
seq) data sets to characterize a stimulus-influenced 
transcriptional network and map functional 
cis-regulatory elements in the reference plant 
Arabidopsis thaliana, with a focus on the phyto- 
hormone abscisic acid (ABA). The response to ABA 
provides a versatile model for the examination of 
stimulus-influenced transcriptional regulation. ABA 
triggers differential expression (DE) of thousands 
of genes and TFs, providing a robust response that 
enables modeling of complex gene regulatory 
networks. Moreover, ABA’s role in a variety of plant 
processes is important for both fundamental 
biology and agriculture (4, 5). 


1Plant Biology Laboratory, Salk Institute for Biological 
Studies, La Jolla, CA 92037, USA. *Computational Biology 
Department, School of Computer Science, Carnegie Mellon 
University, Pittsburgh, PA 15213, USA. ?Genomic Analysis 
Laboratory, Salk Institute for Biological Studies, La Jolla, CA 
92037, USA. “Howard Hughes Medical Institute (HHMI), Salk 
Institute for Biological Studies, La Jolla, CA 92037, USA. 
*Corresponding author. Email: ecker@salk.edu 


SCIENCE sciencemag.org 


Abscisic acid plays a pivotal role in optimizing 
water use in plants and is required for both seed 
development and responses to multiple environ- 
mental stresses, such as drought and high salinity. 
In A. thaliana, ABA is recognized by the PYR/PYL/ 
RCAR receptor proteins (6-8). Binding of ABA 
triggers the interaction of PYR/PYL/RCARs with 
group A PP2C protein phosphatases and derepres- 
ses the SnRK2 protein kinases (7, 9). SnRK2s sub- 
sequently activate substrates such as TFs and elicit 
ABA responses (JO, 17). Whereas many TFs are 
known to contribute to the ABA responses (12), 
little is known about their target genes and the 
way these targets are combinatorially regulated. 
In vitro approaches, including the recently de- 
scribed Arabidopsis cistrome data set, have enabled 
identification of DNA motifs and inference of the 
associated TFs (13-17). However, accurate predic- 
tions are still challenging due to many large, multi- 
member TF families. Furthermore, it is difficult 
to establish a direct link between TF binding and 
transcriptome changes or to address the dynamics 
of TF regulation through in vitro assays. Therefore, 
we applied ChIP-seq to unambiguously discover 
TF targets, examine stimulus-influenced TF bind- 
ing dynamics, and link these findings to subse- 
quent transcriptome changes. 


ChIP-seq analyses of ABA responses 


We profiled the genome-wide binding dynamics 
of a diverse collection of TFs using ChIP-seq to 
develop an in planta ABA transcriptional regula- 
tory network. We first surveyed ABA-responsive 
transcripts in A. thaliana by generating strand- 
specific RNA-seq libraries from 3-day-old etiolated 
whole seedlings treated with either 10 uM (+)-ABA 
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or an ethanol-containing mock for 1, 4, 8, 12, 24, 
36, and 60 hours (Fig. 1A). Among 18,310 expres- 
sed genes, 3061 are differentially expressed [false 
discovery rate (FDR) < 0.01] (table S1) (18) for at 
least one time point. One hour of ABA treatment 
leads to moderate DE of many genes, and most 
transcriptional responses plateau after 8 hours 
(Fig. 1B and fig. $1). On the basis of the gene ex- 
pression data, we performed ChIP-seq experiments 
4 hours after dosing with ABA. We selected TFs 
based on responsiveness to ABA and published 
evidence, aiming to provide a comprehensive re- 
presentation of TF families (fig. S2 and tables S2 
and S3). In general, highly expressed and respon- 
sive TFs were chosen in each representative TF 
family because in the context of an in planta exper- 
imental framework, the impact of these TFs on 
gene expression can be more effectively investi- 
gated compared with that of their weakly expres- 
sed homologs. All TF genes were epitope-tagged 
by a recombineering-based approach (19), mostly 
with large DNA transformable artificial chromo- 
somes, allowing the TFs to be expressed under 
their native promoters and genomic context (table 
83). The final data set consisted of 122 ChIP-seq 
experiments involving 21 TFs from 11 families, in- 
cluding mock- and ABA-treated conditions. 

Overall, the number of binding sites (“peaks”) 
varies greatly across TFs and between treatments 
(Fig. 1C). Most TFs gain binding sites across the 
genome after ABA treatment (Fig. 1C), consistent 
with the fact that these TFs are induced by ABA 
at both the transcript and protein level (Fig. 1D 
and figs. S2 and S3). As exemplified by CYP707A1 
and HAI2 (20, 21), two important genes regulating 
ABA catabolism and signaling, respectively, the 
dynamic binding of TFs elicited by ABA is often 
accompanied by altered transcript abundance of 
the target genes (Fig. 1D). Accounting for binding 
location and strength, a comparison of the genome- 
wide binding profiles of these TFs revealed that 
the TFs are generally grouped by family and known 
physical interactions (Fig. 1E) (22). The binding pro- 
files between NAM, ATAF, and CUC (NAC) and 
other TF families become more similar after ABA 
treatment (Fig. IE, box A versus M), which sug- 
gests that ABA prompts coordinated regulation 
of target genes by these TFs. 


Hormonal effects on TF binding and 
expression of target genes 


We observed substantial changes in TF binding 
at promoter regions of several known components 
of the ABA signaling pathway (Fig. 1D), so we in- 
vestigated whether dynamic binding may predict 
gene function in the ABA pathway. To quantify 
hormone-dependent, locus-specific changes of TF 
binding, we compared ChIP-seq peaks of each TF 
between ABA- and mock-treated conditions by 
performing differential binding analysis of the 
sequencing reads under the peaks (23). Three 
measures of differential binding were calculated 
for each peak: (i) normalized read count change 
(RCC) that measures absolute changes of binding, 
ii) fold change (FC) that measures relative changes 
of binding, and (iii) statistically significant differ- 
ential binding (FDR). Because there are limited 
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down-regulated binding events in our data set, we 
focused on up-regulated binding to determine 
the optimal cutoff of RCC, FC, and FDR to define 
the relationship between dynamic binding targets 
and genes involved in the ABA response. We ex- 
tracted three groups of A. thaliana genes (table S4) 
on the basis of gene ontology (GO) annotation (24). 
Group 1 contains 493 genes involved in the ABA 
response. Group 2 contains 1452 genes involved in 
responses to either ABA or other related processes 


such as water deprivation, osmotic stress, salt stress, 
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Fig. 1. TF identity and hormone treatment determine genome-wide 
binding profiles. (A) Growing A. thaliana in hydroponics allows convenient 
buffer exchange for hormone treatment. (B) DREM-reconstructed RNA 
expression paths 60 hours after exposure to ABA. Each path corresponds to 
a set of genes that are coexpressed. Split nodes (green diamonds) represent 
a temporal event where a group of genes coexpressed up to that point 
diverges in expression, most likely due to regulatory events. Most splits are 
observed up to and including the 4-hour time point, indicating that the 
majority of regulatory events occur during the first 4 hours. (©) The 
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cold, seed development, and stomatal movement. 
Group 3 contains 999 genes involved in responses 
to other hormones after excluding genes shared 
with group 2. Three observations emerged from 
comparing these lists to the TF target gene lists 
defined by various thresholds on RCC, FC, and 
FDR. First, when group 1 and 2 genes are used as 
a reference set, the percentage of TF targets over- 
lapping with the set increases with the number 
of bound TFs (Fig. 2A, left and middle panels). By 
contrast, there is very little, if any, increase when 
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group 3 (other hormone genes) is used as the ref- 
erence (Fig. 2A, right panel). Second, an increase 
of RCC and FC threshold beyond the top 20% 
boosted the percentage of target genes involved 
in ABA-related responses but not the percentage 
of genes related to other hormones (Fig. 2A, right 
panel). This improvement is even more obvious 
for genes targeted by multiple TFs. Last, FDR thresh- 
olds of 0.1 and 0.2 show few differences across all 
analyses. These results support the premise that 
dynamic binding by multiple TFs is an important 
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number of ChiP-seq peaks varies greatly between TFs and treatment con- 
ditions. (D) ABA mediated differential gene expression and altered dynamics 
of TF binding, as exemplified by the CYP707A1 and HAI2 genes. (E) Com- 
parison of binding correlations based on normalized ChIP-seq read counts 
near binding sites shows that TFs from the same family often have similar 
binding profiles. TF-TF interaction (bZIPs and NF-Y, black dashed box) and 
hormone treatment (RD26 and ANACO32, dotted boxes A and M for ABA- 
and mock-treatment, respectively) also contribute to binding profile similar- 
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feature to specifically recover genes involved in 
ABA-related responses. We thus chose the top 
20% of RCCs and FCs, as well as FDR = 0.1, as the 
cutoffs for follow-up analyses. As shown in Fig. 2B 
and fig. S4, peaks passing all three thresholds were 
designated as top-ranked up-regulated (“top up”) 
or down-regulated (“top down”), whereas those 
failing all thresholds were designated as static; all 
remaining peaks were classified as moderately 
up-regulated (“moderate up”) or down-regulated 
(“moderate down”). For all tested TFs except 
FBH3 and HB5, peaks tend to gain binding in- 
stead of maintaining or losing binding after ABA 
treatment. 

We then explored the relationship between 
dynamic TF binding triggered by ABA treatment 
and gene expression. The Dynamic Regulatory 
Event Miner (DREM) (25) reports 11 paths of DE 
genes for the first 8 hours of ABA treatment (Fig. 
2C). As shown in fig. $5, combining DREM with 
DNA motifs from PBM (protein binding micro- 
arrays), AGRIS (Arabidopsis Gene Regulatory In- 
formation Server), and DAP-seq (DNA affinity 
purification sequencing) databases recovered few 


TFs in our data set, likely because of a low over- 
lap of these TFs with the databases (13, 15, 26). 
The DREM model identified TFs from all chro- 
matin immunoprecipitated families except for 
CCAAT-HAP3 and CCAAT-HAP5, which do not 
bind DNA as a monomer in in vitro assays (27). 
In addition, although TF binding was examined 
at a single time point, we observed a positive 
correlation between the number of dynamically 
bound TFs and the magnitude of DE across all 
time points (fig. S6), which suggests that TF 
binding data at 4 hours after dosing with ABA can 
explain a broad temporal span of gene expression. 
ABA-related GO terms such as “seed development” 
and “response to salt/osmotic stress/water dep- 
rivation” were enriched in up-regulated genes, 
whereas a few growth-related terms such as “re- 
sponse to auxin stimulus” and “cell wall organi- 
zation” were enriched in down-regulated genes 
(Fig. 2D). We observed a distinct distribution of 
dynamic binding category across DREM paths 
(Fig. 2E, fig. S1, and table S5). The extent of multi- 
TF dynamic binding is associated with the mag- 
nitude of differential gene expression. For example, 


highly up-regulated genes are often targeted by 
multiple TFs through top-up binding. Moderately 
up-regulated genes are more commonly targeted 
by multiple TFs through moderate-up binding. 
Down-regulated genes are rarely associated with 
top-up binding. Instead, these genes are predom- 
inantly associated with either static binding by 
multiple TFs or down-regulated TF binding. These 
data suggest that DE at the whole-seedling level 
is often subject to a combinatorial regulation by 
multiple TFs. As an independent validation, we 
built a regression model of DE using peak signals 
in ABA- and mock-treated conditions as features 
without hard thresholds for the level of dynamic 
binding. The resulting model reveals that multiple 
TF binding features such as ZAT6, NF-YB2, and 
ABF factors in both ABA- and mock-treated con- 
ditions contribute to DE of target genes (Fig. 2F 
and fig. S7). 


Determinants of differential TF binding 


With the discovery of tens of thousands of dif- 
ferential binding events, we were interested in 
whether we could identify features that may predict 
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Fig. 2. Dynamic TF binding triggered by ABA treatment correlates with 
gene function and expression. (A) Genes targeted by higher numbers of TFs 
with dynamic binding events (x axis) have higher percentages of overlap (y axis) 
with genes annotated with ABA and ABA-related GO terms, but not with GO 
terms specific to other hormones. This positive correlation is stronger for target 
genes associated with stronger dynamics (different-colored lines). (B) Hormone- 
dependent, locus-specific TF binding dynamics vary greatly across the 
genome. Log2 (fold change) of TF binding upon ABA treatment (y axis) was 
plotted against basal binding measured as log2 (normalized read counts) 
under mock treatment (x axis). Peaks were classified by three criteria: read 
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count change (RCC, within top 20%), fold change (FC, within top 20%), and 
DiffBind FDR (<0.1). Peaks satisfying all three criteria were designated as 
“top dynamic” (+++), and those failing all three were designated as “static” 
(- - -). The remaining peaks were designated as “moderately dynamic.” 
(C) DREM analysis shows 11 paths of DE genes after 8 hours of ABA treatment. 
(D) Each DREM path is enriched for specific GO terms. (E) The level of DE is 
correlated with multi-TF dynamic binding. (F) Ridge regression model for DE at 
4 hours, using binding strength in both ABA- and mock-treated conditions and 
including contributions from multiple TFs in both conditions. Regression co- 
efficients are plotted as relative importance of the binding features. 
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binding dynamics. First, we performed motif dis- 
covery by MEME-ChIP (28) to identify enriched 
motifs of all 21 chromatin immunoprecipitated 
TFs from the strongest 600 peaks after either ABA 
or mock hormone treatment. A complete collection 
of the motifs is available at www.ABAtf.net. To 
investigate whether there are additional motifs 
that correlated with TF binding dynamics, we also 
performed motif discovery on both dynamic and 
static peaks for a handful of TFs. These factors 
(NF-YB2, ABF1, ABF4, FBH3, MYB3, RD26, ZAT6, 
and HB7) were selected to represent a variety of 
TF families. Pairwise comparison of primary and 
secondary motifs discovered from dynamic and 
static peaks across the selected TFs revealed three 
major clusters (Fig. 3A). Cluster 1 motifs are com- 


posed of (AG),, repeats. Cluster 2 motifs contain a 
(A/G)G(A/C)CC(A/C) consensus sequence, whereas 
cluster 3 comprises G-box motifs (Fig. 3B). To 
examine the contributors to binding dynamics, we 
used linear regression to model the FC of bind- 
ing as a function of variables, including basal 
binding of the TF (under mock treatment) and 
the number of occurrences (counts) of a set of non- 
redundant sequence features that capture the di- 
versity of motifs. These sequence features were 
selected from major clusters of all motifs found 
in the strongest 600 peaks (motifs of clusters A 
to I) and the dynamic and static peaks (motifs of 
clusters 1 to 3) (fig. S8). Examination of the regres- 
sion coefficient P values (Fig. 3B) suggests that 


the primary motifs of ABF (which also represent 


cluster 3 motifs) and ANAC TFs are associated 
with enhanced dynamic binding, whereas basal 
binding and cluster 2 motifs are associated with 
anegative effect on binding dynamics for a broad 
range of TFs (Fig. 3B and table S6). Including clus- 
ter 3 or cluster 1 motifs in the regression results 
increases the explained variability by up to 20% 
(Fig. 3B). To visualize the effect of the cluster 3 
G-box motif and the cluster 2 motif at the reso- 
lution of individual peaks, we plotted basal bind- 
ing of TFs quantified by normalized read count 
against log2 FC of binding after ABA treatment 
and assigned a color to individual binding events 
on the basis of the count of motifs in the same 
peak (Fig. 3C). The proportion of peaks contain- 
ing the cluster 3 motif increases along with log2 
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Fig. 3. Determinants of TF binding dynamics. (A) Hierarchical clustering of 
motifs enriched in dynamic and static peaks revealed three clusters. Each entry 
in the distance matrix is -log2(P value) of motif similarity reported by Tomtom 
(28). (B) Linear regression of differential binding using basal binding and non- 
redundant sequence features identified positive and negative determinants of 
dynamic TF binding. Heatmap colors map to two-tailed t test P values on the 
regression coefficients for the null hypothesis that the coefficient is zero. The 
sequence features were selected from motifs enriched in the strongest peaks 
in ABA- and mock-treated conditions, as well as dynamic and static peaks (fig. 
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$8). (C) Scatterplots show basal binding of TFs quantified by normalized read 
count in the peak (x axis) against log2(FC) of TF binding after ABA treatment 
(y axis), with the color of each dot mapped to the number of indicated motifs in 
the same peak. The occurrence of cluster 3 and 2 motifs over the distributions 
of log2(FC) of binding is shown in the histograms, with the same color code as 
the scatter plot. The proportion of peaks containing the cluster 3 motif in- 
creases along with log2(FC) of TF binding for the indicated TFs, whereas the 
proportion of peaks containing the cluster 2 motif is negatively correlated with 
log2(FC) of TF binding. 


sciencemag.org SCIENCE 


Downloaded from http://science.sciencemag.org/ on November 12, 2016 


RESEARCH | RESEARCH ARTICLE 


FC of binding, whereas the proportion of peaks 
containing the cluster 2 common motif is nega- 
tively correlated with log2 FC of binding. These 
data suggest that the binding of a TF to the clus- 
ter 3 motif (likely the ABFs) and the binding of 
an unknown family of TFs to the cluster 2 motif 
regulate the binding dynamics of neighboring TFs 
positively and negatively, respectively. 


Construction of an ABA 
response network 


To confirm that dynamic binding is more robust 
than total binding in predicting gene expression 
and genes involved in ABA and related responses, 
we compared the expression and functional com- 
position of genes grouped by the number of tar- 
geting TFs through either any type of binding or 
top-up binding (Fig. 4A). Representation of both 
genes associated with ABA-related GO terms and 
ABA up-regulated genes increases more rapidly 
with the increasing number of TFs that have top- 
up binding. Therefore, we decided to use top- 
ranked dynamic TF binding triggered by ABA 
treatment to demonstrate the wiring of this ABA 
network using the core ABA metabolic and sig- 
naling genes and to calculate the hierarchical 
height of TFs in the network (Fig. 4, B and C, 
and fig. SQA). TFs in the network are organized 
into three tiers by their hierarchical height (Fig. 
4C and fig. S9A). The level of DE of lower-tier TFs 
is often amplified compared with that of upper- 
tier TFs, resulting in greater changes in binding 
dynamics, likely because of greater protein accu- 
mulation (figs. $3, S4, and S9A). Negative regula- 
tors of ABA response—including genes encoding 
ABA catabolic enzymes, protein phosphatase 2Cs, 
and E3 ligases—are often induced by ABA and are 
heavily targeted by multiple TFs through highly 
up-regulated TF binding (Fig. 4C). By contrast, 
positive regulators of the ABA response can either 
be up-regulated due to increased TF binding or 
down-regulated due to reduced TF binding (Fig. 
4C). These results point to a transcriptional feed- 
back strategy in the ABA response, presumably 
to allow rapid restoration of normal growth once 
stress is lifted. Because some transcriptional re- 
sponses triggered by ABA are similar to those trig- 
gered by natural stresses (fig. S9, A and B) such 
as high saline conditions, we also expect to see a 
similar organization of regulatory networks for 
other osmotic-related stresses. 

Extensive targeting by ABA-responsive TFs 
appears specific to the ABA pathway, as the core 
ABA genes are targeted by a substantially greater 
number of TFs through top-up binding than are 
genes from other plant hormones (fig. S10 and 
table S7). However, instances of hormone cross- 
talk can be observed in dynamically targeted DE 
genes. For example, both RGA-like 3 (RGL3), a 
master regulator of gibberellin response, and 
ACC synthase 2 (ACS2), an ethylene biosynthesis 
gene, were reported to be ABA-responsive (29, 30). 
We observed that dynamic binding is mainly con- 
tributed by the bZIP and NF-Y factors to the 
promoter of RGL3 and by a diverse family of 
TFs to the gene body of ACS2 (fig. S11). These 
results demonstrate the utility of these data to 


SCIENCE sciencemag.org 


pinpoint regulatory regions that might modulate 
the expression of genes in one hormone response 
pathway by another. 

No GO term besides the ABA-related ones was 
enriched in DE genes heavily targeted by the 21 
TFs through top-up binding. This is partially be- 
cause more than one-third (12,136/33,601) of the 
genes in the A. thaliana genome still have no 
information regarding their biological processes 
(BPs) (Fig. 5A and table S4). On the basis of a 
“guilt-by-association” paradigm (37), we speculate 
that many BP-unknown genes in Fig. 5A are also 
involved in ABA responses (table S8). As a proof 
of principle, we functionally characterized a family 
in which all members are BP-unknown and differ- 
entially expressed in response to ABA. In partic- 
ular, three members in this family (At3g48510, 
At5g50360, and At5g40790) are heavily targeted 
by TFs through top-up binding (Fig. 5, A and B). 
Little is known about this family except that the 
proper expression of At3g48510 relies on core ABA 
signaling (32). In addition, predicted proteins 


of this family contain no known domains. We 
generated dexamethasone (DEX)-inducible lines 
expressing green fluorescent protein (GFP) fusion 
of the two most heavily targeted genes (At3g48510 
and At5g50360). Analysis by RNA-seq showed 
that a few hundred DE genes were consistently 
identified from both short-term (4 hours) and 
long-term (10 days) DEX induction of the two genes 
(Fig. 5C and table S8). To reflect their regulation 
and function, these genes were named Dynamic 
Influencer of Gene expression 1 (DIGI) and DIG2, 
and their homologs were named D/G-likes (DILs). 
ABA-related GO terms, such as “response to water 
deprivation,” were enriched in DIG down-regulated 
genes (Fig. 5C). Confocal imaging further showed 
that DIGs were localized to the nucleus (Fig. 5D). 
We then tested whether the DIGs are transcription- 
al regulators. ChIP-seq of DEX-inducible GFP-DIGs 
showed that the DIGs bind chromatin. Moreover, 
stronger binding was observed in the promoter 
of DIG down-regulated genes than up-regulated 
ones or non-DE genes (Fig. 5, E and F). De novo 
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Fig. 4. TF network integrates expression and connectivity features of genes in the ABA response. 
(A and B) Expression and functional composition of all genes (A) and TF genes (B) are grouped by the 
number of targeting TFs through either any kind of binding or top-up binding. Compared with any type of 
binding, top-up binding is a better predictor for both ABA-related BP functions and DE. The number 
of genes in each bin is shown in black. The bins to which of the TFs included in this study belong are 
indicated at the top of (B). (©) ABA pathway genes are subject to extensive feedback regulations and 
multi-TF dynamic binding. Chromatin immunoprecipitated TFs are arranged in three tiers by normalized 
hierarchy height. Target genes are grouped by function. Node color depicts changes of transcript abun- 
dance after 4 hours of ABA treatment. Edge color corresponds to TF binding dynamic categories. 
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motif discovery allowed us to identify a CCAAT(n)g, 
ABRE motif strongly enriched near the DIGI 
binding sites within 1 kb of DIG down-regulated 


genes. By contrast, either a weaker motif or no 
similar motif was enriched near DIG binding 
sites in the corresponding regions of non-DE 


genes or DIG up-regulated genes (Fig. 5, G and H). 
Several ABA-responsive or developmental TFs are 
targeted by DIGs and differentially expressed 
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Fig. 5. Network analysis identifies new transcriptional regulators of ABA 
response. (A) Expression and functional composition of DE grouped by the 
number of targeting TFs through top-up binding. The number of genes in each bin 
is shown in black. The bins to which the DIG and DIL genes belong are indicated on 
the right, with the number of targeting TFs shown in parentheses. (B) DIG and 
DILs genes are regulated by multiple ABA-responsive TFs. (Left) Phylogram of 
Arabidopsis DIG and DIL proteins. (Right) TFs targeting the DIG and DIL genes. 
(C) DEX-induction of Di/Gs results in DE of stress- and water-related genes. (Top) 
DE genes by DIGs after the indicated period of DEX treatment. (Bottom) Top GO 
terms enriched in DIG DE genes. (D) Confocal imaging of 9-day-old DEX-treated 
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transgenic seedlings shows that DIGI is nuclear-localized. (E and F) Metagene 
profiles (E) and heatmaps (F) of normalized ChIP-seq read counts surrounding 
DIG DE genes. Down-regulated genes are often associated with strong DIG binding 
in their promoters. (G) Empirical cumulative distributions of —log10(P value) of 
ChIP-seq peaks of DIG1 show that it binds more strongly to DIG down-regulated 
genes than to up-regulated or non-DE genes. (H) A CCAAT(n)g ABRE motif is 
strongly enriched near DIG1 binding sites residing within 1 kb of DIG down- 
regulated genes. Either a weaker motif or no similar motif is enriched in the 
corresponding regions of non-DE genes or DIG up-regulated genes. (I) Induction of 
DIGs results in DE of ABA- and developmental-related TFs. 
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upon the induction of D/Gs (table S9). Among 
these, ATAF1, HY5, and ABF3 have been linked 
to ABA sensitivity (33-35), whereas HY5, SCL3, 
and perhaps IAA19 have developmental roles 
(34, 36, 37). Sequence analysis revealed that DIGs 
are conserved between monocots and dicots (fig. 
S11). A remotely related clade of DJG contains the 
gene Sdr4, which regulates seed dormancy in rice 
(38) (fig. S12). The Sdr4 paralog in Arabidopsis 
is also dynamically targeted by multiple ABA- 
responsive TFs and differentially expressed in 
response to ABA (table S1). However, the function- 
ally important amino acid residues of Sdr4 are 
not conserved in the D/Gs and their homologs 
(fig. S13) (38). Therefore, genes in the DJG and 
Sdr4 clades may exert ABA-related functions 
through distinct mechanisms. Finally, inducible 
expression of D/Gs enhances ABA sensitivity, as 
assayed by cotyledon greening (Fig. 6, A and B) 
and lateral root growth (Fig. 6D). Similarly, en- 
hanced growth inhibition of DIG lines can also 
be observed after prolonged growth under high- 
NaCl conditions (Fig. 6, C, E, and F). Combined, 
our results suggest that DIGs are a family of tran- 


scriptional regulators with broad roles that include 
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Fig. 6. DIG-inducible lines exhibit enhanced sensitivity to ABA and salt. 
(A and B) ABA-dependent delay of cotyledon greening in 8-day-old seedlings 
was further amplified upon DEX-mediated induction of DIGI and DIG2 com- 
pared with GFP control quantified by count of green cotyledons (A) and mea- 
surement of relative chlorophyll content (B). In (B), each error bar reflects the 
95% confidence interval around the mean estimate calculated from three 
biological replicates of ~50 8-day-old seedlings. (C) NaCl-dependent bleaching 
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regulating gene expression that affects ABA sensi- 
tivity and salt stress responses. 


Conclusions 


We performed a systematic study of a transcrip- 
tional network by combining dynamic binding 
data of 21 TFs and time series RNA-seq data in 
response to a stimulus by the plant hormone ABA. 
We found that dynamic TF binding measured 
at a single time point correlated with the tran- 
scriptome changes over a prolonged span of time. 
Consistent with yeast and animals (2, 3, 39, 40), 
transcription of genes in Arabidopsis is often sub- 
ject to a complex regulation of multiple TFs. We 
further demonstrated that dynamic binding, espe- 
cially by multiple TFs, is more functionally rele- 
vant than static TF binding in correlation with 
differential gene expression. We speculate that 
this is because an expression scheme with co- 
ordinated changes in the binding dynamics of 
multiple TFs would ensure robust responsiveness 
of target genes to a stimulus. This observation 
would have a broad application to both plants 
and other species, including prioritizing studies 
of (i) TF binding events and cis-regulatory ele- 
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ments and (ii) functionally unknown genes in a 
pathway. 

In plants, studies of transcriptional regulation 
are often focused on master regulators (33, 41, 42). 
Our data confirmed the importance of master reg- 
ulators in plants. For instance, we showed that 
ABFs and a physical interactor (NF-YB2) ranked 
as top contributors to explain gene expression. In 
addition, the primary binding motif of ABFs also 
enhances the binding dynamics of many other 
ABA-responsive TFs. However, more than just the 
master regulators are required to attain complex 
transcriptome changes to a stimulus, as many 
ABA-responsive genes are dynamically targeted 
by multiple TFs. Therefore, ABA response can be 
viewed as orchestrated by a handful of master 
regulators and facilitated by other TFs, where co- 
ordinated signaling and regulatory response lead 
to a rapid elicitation of transcriptome changes. 

As indicated by GO annotation, network ana- 
lyses of this study recovered genes affecting all 
aspects of ABA-related processes, such as seed 
development and response to osmotic stresses. 
In planta ectopic expression of several members 
of a newly discovered family of transcriptional 
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was observed in 4-week-old plants upon DEX-mediated induction of DIGI and 
DIG2. (D and E) DEX-mediated induction of DIGI (D1) and DIG2 (D2) resulted in 
more severe inhibition of lateral root growth than GFP (G) control plants on 
ABA (D) and NaCl (E) plates. (F) DEX-mediated induction of DIG1 and DIG2 
led to overaccumulation of pigments in leaves. In (D) to (F), seedlings were 
transferred to the indicated plates after being grown on Linsmaier and Skoog 
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regulators also exhibited altered response to both 
ABA and high salinity. Therefore, although we 
carried out the experiments using seedlings, the 
discoveries may be directly applicable to a broader 
range of development stages and stress scenarios. 
Emerging technologies to optimize plant water 
use have been developed based on the in-depth 
characterization of ABA perception (43). Knowl- 
edge derived from further studies of the genes 
uncovered in this study may also prove valuable 
to global agriculture, possibly enabling new strat- 
egies for plants to respond to the challenges of 
ongoing drought and groundwater depletion in 
changing environments. 


Materials and methods 


Plant materials 

Recombineering lines for the ChIP-seq experi- 
ments were generated as previously described (19) 
with minor modifications. A YPet-6xHis-3xFLAG 
tag and a 3xFLAG-YPet tag were designed for C 
terminus and N terminus fusion to the TFs of in- 
terest (table S3 and fig. S14). To abolish weak di- 
merization of YPet (44), an A206K point mutation 
was introduced by primers 5'-ATCCTTGAAGAG- 
CTTAGACTGGTAAGA-3’ and 5'-TCTTACCAGT- 
CTAAGCTCTTCAAGGAT-3’. After floral dip of 
wild-type Col-0 plants (45), T1 seeds were pooled 
and transgenic plants were selected on plates con- 
taining Ix Linsmaier and Skoog (LS) pH buffered 
basal salts, pH 5.7 (Caisson laboratories, UT, USA, 
cat. # LSP03-1LT) with 0.7% agar and 15 ug/ml 
glufosinate ammonium (Fisher Scientific, NH, 
USA, cat. # N-12111-250MG). Single-insertional 
transgenic lines were selected by chi-square test 
from T2 plants on Ix LS plates containing 15 ug/ml 
glufosinate ammonium. The expression of the 
tagged TFs was confirmed by Western blotting. 
Homozygous transgenic lines were selected from 
the subsequent generation for bulking seeds. DEX- 
inducible lines for the functional characterization 
of DIGs were generated by cloning the coding se- 
quence of DIGs into p35S::LhGR-p6xOP::mGFP- 
attL1-ccdB-attR1 cassette by LR combination. After 
floral dip of wild-type Col-0 plants, T1 seeds were 
pooled and transgenic plants were selected by 
hygromycin. 


ChIP-seq experiments and analysis 


0.4 g seeds were surface sterilized by 50% bleach 
+ 0.05% Triton-X100 for 10 min. After 4 days of 
stratification at 4°C, seeds were spread on nylon 
mesh (Component Supply, FL, USA, cat. # U-CMN- 
215) in 6 hydroponics (Sigma-Aldrich, MO, USA, 
cat. # P1552) containing 1x pH buffered LS basal 
salts. After exposure under light for 4 hours to 
enhance germination, seeds were grown in dark 
at 22°C for 3 days. Etiolated seedlings were then 
switched to Ix LS buffer containing either (+)-ABA 
(MP Biomedicals LLC, CA, USA, cat. # 190673) 
dissolved in 100% ethanol at a final concentration 
of 10 pM or ethanol alone as mock and treated 
for 4 hours in dark before ChIP as previously de- 
scribed (46). Briefly, harvested seedlings were 
cross-linked by 1% formaldehyde solution (Sigma- 
Aldrich, cat. # F8775) under vacuum for 20 min. 
After nuclei isolation, chromatin was sonicated 
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to 100-400 bp fragments. Tagged TFs in the trans- 
genic lines were immunoprecipitated by a rabbit 
polyclonal anti-GFP antibody (Thermo Fisher Scien- 
tific, MA, USA, cat. # A11122). After elution, reverse 
crosslinking and DNA purification, Illumina TruSeq 
libraries were constructed according to manufac- 
turer’s protocols. All ChIP-seq experiments in both 
ABA and ethanol mock-treatment conditions were 
done with biological replicates. Uniquely mapped 
sequencing reads to the TAIR10 genome assembly 
(Bowtie vO0.12.7) (47, 48) were used to call peaks by 
the IDR pipeline utilizing MACS2 (49) with mock 
IP of wild-type Col-0 ChIPped by the anti-GFP 
antibody as a control. Peaks with a P < 1x 10° 
were kept and differential binding of TFs were 
analyzed by DiffBind (v1.10.1 with edgeR v3.0.8) 
(23). To calculate TF binding similarity in Fig. 1E, 
the center of peaks (termed “summits”) of all 
ChIPped TFs were pooled together to create a 
union list. Sequencing coverage within 50 base 
pairs of summits in the union list was counted 
and normalized by deepTools (v1.5.8) (50). Pair- 
wise Pearson correlation between samples was 
used as entries in the distance matrix to plot the 
heat map in Fig. 1E. Hierarchy height of ChIPped 
TFs was calculated as previously described (40) 


h=(O-D/(O+D 


where O and J are out-degree and in-degree of 
examined TF through top-ranked dynamic bind- 
ing. Peaks in each dynamic binding categories 
were associated to TAIR10 annotated genes within 
1000 bp from the summit of the peaks, using the R 
BioConductor package ChIPpeakAnno (v2.12.1) (57). 


Motif discovery and modeling the 
contribution of individual motifs to TF 
binding dynamics 


De novo motif discovery was carried out by MEME- 
ChIP (MEME 4.9.1) using a background file calcu- 
lated from TAIR10 intergenic sequences (47). Top 
five enriched motifs identified within 50 base pairs 
of the summits were filtered at e-value cutoff of 
le-05. To model the contribution of individual 
features, a set of nonredundant sequence features 
were selected to represent the overall motif diver- 
sity. To do this we first assembled a set of 135 
motifs in our data set, consisted of the two most 
enriched motifs in the top 600 peaks in ABA- and 
mock-treated conditions for each TF, as well as 
top five motifs enriched in dynamic and static 
peaks for all TFs. The motifs were clustered by 
applying hierarchical clustering using motif dis- 
tances calculated by Pearson correlated coefficients 
as column comparison metric and Ungapped 
Smith-Waterman alignment method (52, 53). Dy- 
namic tree cut of the clustering dendrogram (54) 
identified 19 major clusters (color of dendrogram 
branch and the left of the annotation tracks in 
fig. S8). As several of the clusters contain similar 
motifs (for example, the G-boxes and the AG-rich 
motifs are split into multiple clusters), we selected 
11 sequence features to capture the diversity in 
this set of motifs indicated by dark red color of 
motif name and dark red color in the right anno- 
tation track in fig. S8. Basal binding was measured 
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as log2(normalized read counts) under mock treat- 
ment and occurrences of motifs were assessed by 
FIMO (55) at a P value cutoff of 0.0004. These 
features were used in the lm() function of R 
(v3.2.2) to fit log2(fold change) of the binding of 
indicated TFs in Fig. 2D between ABA- and mock 
treatment. Relative changes of explained vari- 
ance was calculated as 


(R? -R?) /R? 


where R? and R? are the adjusted R? from Im() 
output that includes and excludes cluster II or 
cluster III motif as a feature, respectively. 


RNA-seq experiments and analysis 


For ABA time series experiments, two biological 
replicates of 3-day-old etiolated, hydroponic-grown 
wild-type Col-0 seedlings were treated either by 
10 uM (+)-ABA (MP biomedicals, LLC, cat. # 190673) 
dissolved in ethanol or ethanol-only mock control 
for 1, 4, 8, 12, 24, 36, and 60 hours. For DEX treat- 
ment, short-term experiment was carried out by 
treating 3-day-old etiolated, DEX-inducible GFP- 
DIGI, GFP-DIG2, or GFP lines with 10 nM DEX 
(Sigma Aldrich, cat. # D9184); long term exper- 
iment was carried out by growing the same lines 
of plants containing 500 nM DEX for 10 days. 
Total RNA was isolated using the RNeasy Plant 
Mini Kit (Qiagen, CA, USA, cat. #74903), and cDNA 
libraries were constructed using the TruSeq Strand- 
ed Total RNA LT Sample Prep Kit (lumina, CA, 
USA, cat. # 15032611) according to manufacturers’ 
instructions. Single-end reads were generated by 
the HiSeq 2500 Sequencing System (Illumina) and 
mapped to TAIR10 genome assembly using TopHat 
2 (v2.0.8) (56). Mapped reads with mapping score 
equal to or larger than 10 were counted by HTSeq 
(v0.5.4) (57) and analyzed by edgeR (v3.6.2) (18) to 
identify differentially expressed genes using con- 
trasts between ABA- and mock-treated samples at 
each time point and false discovery rate 0.01 or 
0.05 as thresholds. 


DREM 


The Dynamic Regulatory Events Miner (DREM) 
(25, 58), integrates TF-gene interactions from ChIP- 
seq experiments with time series gene expression 
data to identify patterns of temporal gene expres- 
sion, the associated regulators and the dynamics 
of the interactions. Splits in the reconstructed net- 
work (green nodes in Figs. 1B and 2C and figs. S1 
and S5) represent divergence of genes that are co- 
regulated up to that point and can be annotated 
by DREM with the TFs that are predicted to reg- 
ulate them, allowing us to associate the temporal 
information (the timing of the splits) with the 
interaction information either directly measured 
by ChIP-seq (fig. S1) or inferred from the AGRIS 
database (26), PBM (73), and DAP-seq (15) data 
(fig. S5). The analysis performed here used the log 
fold change of 3061 DE genes (table S1) identified 
in the ABA time series RNA-seq data. DREM paths 
were created using all DE genes without further 
filtering. 

For GO enrichment in DE genes targeted by 
categories of dynamic binding peaks (Fig. 2D), 
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we defined the genes in distinct DREM paths as 
foreground, and all expressed genes as back- 
ground, and retrieved the Functional Annotation 
Chart with EASE score (modified P value) thresh- 
old of 0.1 and count threshold of 2, using function- 
alities provided by the R BioConductor package 
RDAVIDWebService (59) to query the DAVID web 
service (60). The GO terms in GO_TERM_BP_FAT 
with FDR < 1% from all target gene sets are 
combined, and the enrichment P values of these 
terms are retrieved for each gene set to create the 
heatmap in Fig. 2D. If a term is not reported to 
be significant for a target set, its P value is set to 
0.1 (the P value threshold). 


Modeling the contribution of individual 
TFs to gene expression 


We adopted an approach similar to previous 
regression-based models that relate gene expres- 
sion to TF binding (67, 62). We first defined TF 
affinity score (IFAS), A;, for TF 7 on gene i, using 
the peak closest to the TSS of the gene 


a 
Ay = ge % 


where g is the log2 TMM normalized read counts 
of the peak, d is the distance of the peak summit 
to TSS. dp is set to 1000. For N genes and M TF, 
we constructed one NV x M TFAS matrix for the 
ABA treatment and one for the ethanol mock 
treatment, and concatenated these two matrices 
horizontally to create a final N x 2M matrix A. 
We centered and scaled each column of A and fit 
a log-linear model 


2M 
logy; = 5 BAy + € 
j=l 


where Y; is the fold change in expression of gene 7 
at 4 hours ABA treatment compared to mock. We 
limited the model training and testing to genes 
that are differentially expressed at 4 hours with 
FDR < 0.01 and those are not differentially ex- 
pressed at all time points (FDR > 0.7). A glmnet 
regression model (63) was trained on 75% of the 
genes by five repeats of 10-fold cross-validation 
using the caret package in R (64) with tuning met- 
ric set to RMSE and the elastic net mixing param- 
eter a = 0 to allow selection correlated TFAS 
features. The “best” rule was used to choose a value 
for the tuning parameter (in this case, the regular- 
ization parameter A), i.e., a value that minimized 
the average RMSE of the regression on the 50 re- 
sampling of the training set. The glmnet model 
was then fitted using the chosen A value to arrive 
at the regression coefficients in the final model. 
The unscaled coefficients of the TFAS features are 
plotted as binding feature importance in Fig. 2F. 


Confocal imaging 


Nine-day-old DEX inducible GFP and GFP-DIG1 
seedlings grown on 1x LS plates containing 200 nM 
DEX and 300 nM ABA were imaged by Zeiss 
710 confocal microscope under an Argon laser at 
488 nm. GFP signal was captured within the 493- to 
548-nm emission window and was pseudo-colored 
in green. Auto-fluorescence from chloroplasts was 
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captured within the 569- to 695-nm emission win- 
dow and was pseudo-colored in red. 


Sequence analysis of DIGs and 
their homologs 


Protein sequence of DIGI was used as a query 
to search for homologous protein sequences in 
A. thaliana by the BLASTP search tool on 
Ensembl Plants (65). The resulting six protein 
sequences (Q9FK36, QOSMP6, QOFGW7, Q9LK28, 
Q9FKS6, QOFKS7) were used to query A. thaliana, 
Glycine max, Solanum lycopersicum, Oryza sativa 
japonica, and Zea mays by BLASP, resulting in 
21 homologous sequences. These sequences were 
aligned by MEGAG (66) using distance-based max- 
imum likelihood method, and bootstrap values 
were generated from 1000 replications. 


Chlorophyll measurement 


Chlorophyll content was determined as previously 
described (67). Briefly, each sample consisting of 
~50 seeds were germinated and grown on LS 
plates supplemented with or without ABA and 
DEX for eight days. The seedlings were collected 
and ground in liquid nitrogen. Chlorophyll were 
extracted by 80% acetone until pellets were al- 
most white. Absorbance was measured at 647 
and 664 nm in a DU-730 spectrophotometer 
(Beckman Coulter, CA, USA). Chlorophyll content 
was determined as 


chl a+ b = 17.76 «A647 + 7.34 A664 


Chlorophyll content of each transgenic line was 
then normalized by the corresponding seedlings 
grown on LS plates containing no DEX or ABA. 
The 95% confidence interval around the mean 
estimate was calculated from three biological 
replicates. 
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INTRODUCTION: Genetic suppression oc- 
curs when the phenotypic defects caused by 
a mutated gene are rescued by a mutation in 
another gene. 

These genetic interactions can connect 
genes that work within the same pathway 
or biological process, providing new 
mechanistic insights into cellular 
function, or they can correct defects 
in gene expression or protein pro- 
duction. More generally, suppression 
interactions may play an important 
role in the genetics underlying hu- 
man diseases, such as the diverse penetrance 
of Mendelian disease variants. Our ability to 
interpret personal genome sequences remains 
limited, in part, because we lack an under- 
standing of how sequence variants interact in 
nonadditive ways to generate profound phe- 
notypes, including genetic suppression. 


RATIONALE: Genetic interactions, in which 
mutations in two different genes combine to 
generate an unexpected phenotype, may un- 
derlie a significant component of trait her- 
itability. Although genetic interactions that 
compromise fitness, such as synthetic lethality, 
have been mapped extensively, suppression 
interactions have not been explored system- 
atically. To understand the general principles 
of genetic suppression and to examine the 
extent to which these interactions reflect 
cellular function, we harnessed the powerful 
genetics of the budding yeast Saccharomyces 
cerevisiae to assemble a global network of gen- 
etic suppression interactions. 


RESULTS: By analyzing hundreds of pub- 
lished papers, we assembled a network of 
genetic suppression interactions involving 
~1300 different yeast genes and ~1800 unique 
interactions. Through automated genetic map- 
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ping and whole-genome sequencing, we also 
isolated an unbiased, experimental set of ~200 
spontaneous suppressor mutations that cor- 
rect the fitness defects of deletion or hypomor- 
phic mutant alleles. Integrating these results 
yielded a global suppression network. 

The majority of suppression in- 
teractions identified novel gene-gene 
connections, thus providing new in- 
formation about the functional wiring 
diagram of a cell. Most suppression 
pairs connected functionally related 
genes, including genes encoding mem- 
bers of the same pathway or complex. The 
functional enrichments observed for suppres- 
sion gene pairs were several times as high as 
those found for other types of genetic in- 
teractions; this highlighted their discovery 
potential for assigning gene function. Our 
systematic suppression analysis also identi- 
fied a prevalent allele-specific mechanism of 
suppression, whereby growth defects of hypo- 
morphic alleles can be overcome by mutations 
that compromise either protein or mRNA 
degradation machineries. 

From whole-genome sequencing of suppres- 
sor strains, we also identified additional sec- 
ondary mutations, the vast majority of which 
appeared to be random passenger mutations. 
However, a small subset of genes was enriched 
for secondary mutations, several of which 
did not affect growth rate but rather appeared 
to delay the onset of the stationary phase. This 
delay suggests that they are selected for under 
laboratory growth conditions because they 
increase cell abundance within a propagat- 
ing population. 


CONCLUSION: A global network of genetic 
suppression interactions highlights the major 
potential for systematic studies of suppression 
to map cellular function. Our findings allowed 


us to formulate and quantify the general 
mechanisms of genetic suppression, which 
has the potential to guide the identification 
of modifier genes affecting the penetrance of 
genetic traits, including human disease. 
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Global analysis of genetic suppression. Ge- 
netic suppression interactions occur when the 
detrimental effects of a primary mutation can 
be overcome by a secondary mutation. Both 
literature-curated and experimentally derived sup- 
pression interactions were collected and yielded 
a genetic suppression network. This global network 
was enriched for functional relationships and de- 
fined distinct mechanistic classes of genetic 
suppression. 
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Genetic suppression occurs when the phenotypic defects caused by a mutation in a particular 
gene are rescued by a mutation in a second gene. To explore the principles of genetic 
suppression, we examined both literature-curated and unbiased experimental data, involving 
systematic genetic mapping and whole-genome sequencing, to generate a large-scale 
suppression network among yeast genes. Most suppression pairs identified novel relationships 
among functionally related genes, providing new insights into the functional wiring diagram 

of the cell. In addition to suppressor mutations, we identified frequent secondary mutations, 
in a subset of genes, that likely cause a delay in the onset of stationary phase, which appears 
to promote their enrichment within a propagating population. These findings allow us to 
formulate and quantify general mechanisms of genetic suppression. 


Ithough causative variants have been iden- 

tified for many Mendelian disorders, chal- 

lenges remain in understanding how genetic 

variants combine to generate phenotypes. 

Great progress has been made in mapping 
and interpreting genetic interactions in yeast, by 
using growth rate as a proxy for fitness. High- 
throughput genetic interaction studies have iden- 
tified hundreds of thousands of negative and 
positive interactions, in which the fitness defect 
of a yeast double mutant is either more or less 
severe, respectively, than the expected effect of 
combining the single mutants (Fig. 1A) (J, 2). 
Positive interactions indicate that the phenotypic 
effects associated with detrimental mutations can 
be masked or overcome and may explain why 
certain individuals are healthy despite carrying 
severe disease-causing mutations (3). 

Positive interactions can be further classified 
by their relative strength, ranging from masking, 
in which the double mutant fitness is higher than 
expected but less than or equal to that of the slow- 


est growing single mutant, to suppression, in 
which the double mutant is healthier than the 
slowest growing single mutant and possibly has 
a fitness that is comparable to that of wild type 
(Fig. 1A) (/, 4). These classes of positive interactions 
can represent biologically distinct functional rela- 
tionships (4, 5). Most positive interactions iden- 
tified by systematic genetic interaction screens 
in yeast, based on synthetic genetic array (SGA) 
analysis with loss-of-function mutations (2, 6), 
are relatively weak masking interactions (fig. S1A), 
such as the positive interactions that occur among 
genes within the same nonessential complex or 
pathway (7). By contrast, stronger suppression 
interactions remain largely unexplored. 
Spontaneous suppressor mutations can be se- 
lected to overcome the fitness defect associated 
with a specific mutant allele. Extragenic suppres- 
sor mutations encompass two basic classes: (i) in- 
formational suppressors that change the protein 
translational or mRNA transcriptional machinery, 
such that the primary mutation is reinterpreted, 


and (ii) functional suppressors in which a muta- 
tion in a second gene functionally compensates 
for the defect associated with the primary muta- 
tion (8). Here, our major goal was to investigate 
the general principles of functional suppression 
by assembling a global network of these inter- 
actions, which should provide new mechanistic 
insights about protein function and enable the 
ordering of components of biological pathways. 


A network of literature-curated 
suppression interactions 


To capture existing suppression interactions in 
Saccharomyces cerevisiae, we examined ~6000 
potential interactions in ~1700 published papers 
derived from the BioGRID’s “synthetic rescue” 
data set (9). From each interaction, we annotated 
the type of suppressor mutation (e.g., spontane- 
ous mutation or deletion allele); the type of mu- 
tation that is being suppressed, which we refer to 
as a “query” mutation; and the use of specific 
conditions (e.g., a drug or specialized carbon 
source). Suppression interactions that were in- 
tragenic, involved a specific phenotype other 
than growth, or included more than two genes 
were excluded from the final data set. We also 
removed suppression interactions derived from 
high-throughput experiments or dosage inter- 
actions in which either the query or the suppressor 
was overexpressed. The resulting literature-curated 
network encompassed 1304 genes and 1842 unique 
suppression interactions (table S1). We visual- 
ized this network using a force-directed layout 
(10), so that query genes that share a common 
suppressor tend to be positioned together (Fig. 
1B). Most query genes (69%) are suppressed by 
one or two suppressor genes, whereas a small 
subset of queries (5%) have numerous (10 to 27) 
reported interactions (fig. S1B). Despite the rela- 
tively low average network degree, genes in- 
volved in highly studied processes, such as DNA 
replication and repair or chromatin and trans- 
cription, tend to group together because of their 
shared suppression interactions (Fig. 1B). 
Combining data from multiple studies can 
reveal suppression mechanisms between path- 
ways or protein complexes that may not be ap- 
parent from any individual study. Indeed, a 
subnetwork focused on DNA replication and 
repair pathways showed that many of the inter- 
actions appear to represent the activation of al- 
ternative DNA repair pathways (Fig. 1C). For 
example, mutations that perturb Rad51-dependent 
homologous recombination (HR) often lead to 
toxic chromosomal deletions or rearrangements 
due to increased repair of double-strand DNA 
breaks by nonhomologous end joining (NHEJ) 
(1D. In this case, suppression can occur through 
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Fig. 1. A global network of literature-curated suppression interactions for 
S. cerevisiae. (A) Genetic interaction classes. When two single mutants (xxxA 
and yyyA) have a relative fitness of 0.8 and 0.7, the expected fitness of the resultant 
double mutant (xxxA yyyA) based on a multiplicative model is 0.8 x 0.7 = 0.56. A 
negative genetic interaction occurs when the observed double mutant fitness is 
lower than this expected fitness. A masking positive interaction occurs when the 
fitness of the double mutant is greater than expected, but lower or equal to that 
of the slowest growing single mutant. Suppression positive interactions occur 
when the double mutant fitness is greater than that of the slowest growing 
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single mutant. (B) A global network of literature-curated suppression interactions 
for S. cerevisiae. Genes are represented as nodes and interactions as edges. The 
nodes were distributed using a force-directed layout, such that genes that share a 
suppressor tend to be close together on the network. Genes involved in chromatin 
and transcription or DNA replication and repair are highlighted in magenta and 
cyan, respectively. (C and D) Regions of the global network highlighting sup- 
pression interactions between complexes and pathways involved in chromatin 
and transcription (C) or DNA replication and repair (D) are shown. Arrows point 
from the suppressor to the query. PCNA, proliferating cell nuclear antigen. 
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NHEJ inactivation, which favors double-strand 
break repair by the compromised, but more ac- 
curate, HR machinery (72). Similar trends are ob- 
served for genes involved in transcription, for 
which suppression interactions between path- 
ways mainly represent activation or repression 
of transcription (Fig. 1D). For example, mutations 
in genes encoding Mediator or RNA polymerase 
II subunits can reduce transcription efficiency, 
which can suppress the toxic effects of derepressed 
transcription caused by loss-of-function muta- 
tions in the NC2 transcription regulator complex 
(12). Thus, by integrating data from hundreds of 
papers, we derived a suppression network that 
provides insight on general suppression relation- 
ships and the ordering of pathways and com- 
plexes within a biological process. 


Suppression interactions within and 
across cellular processes 


Consistent with other biological networks (2, 13-15), 
many suppression interactions occurred between 
functionally related genes, such that a query mu- 
tant tended to be suppressed by another gene an- 
notated to the same biological process (Fig. 2A). 
Genes connected by suppression interactions also 
tended to be coexpressed and encode proteins 
that function in the same subcellular compart- 
ment and/or belong to the same pathway or 
protein complex (Fig. 2B). The extent of func- 
tional relatedness between suppression gene pairs 
did not depend on the conditions under which 
the interaction was identified (e.g., a specific 
drug or carbon source), or whether the suppres- 
sor was isolated as a spontaneous suppressor 
mutation as opposed to an engineered allele that 
was directly tested for an interaction (fig. S2A). 
However, the frequency of shared complex mem- 
bership was significantly higher for gene pairs 
in which the suppressor gene carried a gain-of- 
function mutation compared with gene pairs in- 
volving loss-of-function suppressor mutations (P = 
0.01, Fisher’s exact test). Thus, when a query muta- 
tion perturbs a subunit of a complex, compensat- 
ing mutations in another subunit can be gain of 
function—for example, by stabilizing the complex. 
Notably, the functional enrichment observed 
in the genetic suppression network was substan- 
tially stronger than in a global network of nega- 
tive and positive genetic interactions generated 
with SGA (6) (Fig. 2B). In fact, most positive 
genetic interactions identified in the global SGA 
network, especially among loss-of-function alleles 
of essential genes, do not overlap with other func- 
tional interaction data. Suppression interactions 
thus constitute a special class of positive genetic 
interaction that captures highly specific function- 
al relationships between gene pairs (fig. S2B). 
Despite their tendency to connect functionally 
related genes, suppression interactions also con- 
nect different biological processes. These inter- 
actions often occurred between genes involved 
in related processes, such as Golgi, endosome, 
or vacuole sorting and ER-Golgi traffic (Fig. 2A). 
Note that genes involved in protein degradation 
suppress growth defects associated with mutation 
of genes involved in many different biological 
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Fig. 2. Properties of the suppression network. (A) Frequency of suppression interactions connecting 
genes within and across indicated biological processes. Node size reflects fold enrichment for interacting 
gene pairs observed for a given pair of biological processes. Significance of the enrichment was determined 
by Fisher's exact test, comparing the observed frequency of suppression interactions between two given 
functional categories with the global frequency. The total number of suppression interactions involving genes 
annotated to a particular process is indicated. Kinet., kinetochore. (B and C) Fold enrichment for (B) col- 
ocalization, GO coannotation, coexpression, same pathway membership, and same complex membership 
for gene pairs involved in different types of genetic interaction (GI); and (C) overlap of literature-curated 
suppression interactions with dosage suppression interactions (13), or with negative and positive genetic 
interactions identified by SGA analysis using either an intermediate or a stringent interaction score threshold 
(6). A Fisher's exact test was performed to determine statistical significance of the results. (D) An example of 
a gene pair showing suppression, dosage suppression, and negative genetic interactions. 
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processes. This central role for protein turnover 
in the suppression network likely reflects a more 
general mechanism whereby growth defects of 
conditional temperature-sensitive (TS) alleles of 
essential query genes, which are often hypomor- 
phic (partially functional) even at a permissive 
temperature, can be overcome by additional mu- 
tations that weaken the protein degradation ma- 
chinery and elevate protein levels. 


Overlap with other genetic networks 


The suppression network shows significant over- 
lap with a dosage suppression network (13) (P = 2 x 
107°, Fisher’s exact test) and with SGA-derived 
positive and negative genetic interaction networks 
(2, 6) (P =5 x 10*” and P = 1x 10°”, respectively, 
Fisher’s exact test). The overlap with positive gen- 
etic interactions (fivefold enrichment) (Fig. 2C) 
is expected, as suppression interactions are an 
extreme type of positive interaction. Indeed, this 
overlap increases (11-fold enrichment) for stron- 
ger positive genetic interactions. The overlap of 
the suppression network with dosage-suppression 
interactions associated with gene overexpression 
reflects that overexpression may lead to a gain- 
of-function phenotype (16) and suppression can 
involve gain-of-function alleles (Fig. 2C and fig. 
S2C). Gain-of-function suppressor mutations also 
explain the 2.5-fold enrichment for negative genetic 
interactions between loss-of-function alleles (Fig. 
2C and fig. S2C). For example, whereas the growth 
defect associated with loss-of-function mutations 
in CDC25, which encodes the guanine nucleotide 
exchange factor that activates Ras2, can be sup- 
pressed by gain-of-function mutations in RAS2, loss- 
of-function mutations in RAS2 exacerbate the cdc25 
growth defect, thereby causing a synthetic lethal 
negative genetic interaction (Fig. 2D). Despite 
the overlap with other genetic networks, most sup- 
pression interactions (78%) are specific to the sup- 
pression network and thus provide novel insights 
into the functional wiring diagram of a cell. 


Systematic identification of spontaneous 
suppressor mutations 


Literature-curated data can come from specific 
hypothesis-driven experiments and may thus 
be biased (15, 17). We therefore compared the 
curated suppression network to an independent 
experimental set of spontaneous suppressor 
mutations identified through the large-scale ap- 
plication of SGA analysis. In SGA, a specific natMX- 
marked query mutation is crossed to an array 
of ~5000 kanMX-marked deletion mutants, to 
systematically construct a complete set of hap- 
loid natMX- and kanMX-marked double mutants 
(18, 19). This also represents a genome-wide set 
of two-factor crosses, enabling us to scan the 
query strain genome for the presence of an un- 
marked extragenic suppressor locus, which SGA 
analysis reveals as a collinear set of small col- 
onies spanning the genomic location of the sup- 
pressor mutation, which we refer to as a linkage 
group (20, 21) (fig. S3A). In total, we completed 
7056 full-genome SGA screens, involving mutant 
strains carrying deletion or hypomorphic alleles 
of 5845 different genes (2, 6). In 251 SGA screens 
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(~4%), we identified a linkage group that sug- 
gested the presence of a spontaneous extragenic 
suppressor mutation (tables S2 and $3). 

The 251 candidate suppressor strains were an- 
alyzed by whole-genome sequencing, and for 216 
(86%) of these, a mutation was discovered within 
the suppressor locus identified by SGA (fig. S3A 
and table $2). Almost all (98%) of these mutations 
were subsequently confirmed by Sanger sequenc- 
ing (table $2). For 24 genes, multiple indepen- 
dently generated query strains carried a potential 
extragenic suppressor mutation (table S2). In 13 
(54%) of these 24 cases, the extragenic suppressor 
mutations were in the same gene, whereas in the 
remaining 11 cases, two different suppressor genes 
were identified. In three instances, these different 
suppressor genes encoded known members of the 
same complex. 

We next validated candidate suppressor genes 
using several genetic tests, including plasmid- 
based complementation assays and tetrad anal- 
ysis of meiotic progeny derived from crossing 
each suppressor strain to a wild-type strain, a 
strain with a marked deletion that was genet- 
ically linked to the candidate suppressor, or a 
strain carrying a deletion or hypomorphic allele 
of the suppressor gene (fig. S3A) (227). Of the 
suppressor interactions, 88% gave a positive re- 
sult in at least one assay (table S2). Based on these 
assays and the type of mutation, one-third (33%) 
of the suppressor mutations appeared to be gain- 
of-function, and two-thirds (67%) appeared to be 
loss-of-function mutations. We also randomly 
selected four potential loss-of-function and five 
potential gain-of-function suppressor alleles and 
introduced those into a diploid strain that was 
heterozygous for the corresponding query muta- 
tion. In all cases, sporulation and tetrad analysis 
of the resulting diploids confirmed the genetic 
interaction and identity of the suppressor muta- 
tion (table S2 and fig. S3A). Thus, we identified 
216 unbiased mutations that arose spontaneously 
to suppress severe growth defects associated with 
146 deletion mutants of nonessential genes and 
70 hypomorphic alleles of essential genes (table S2). 

Although we observed significant overlap with 
the literature-curated data set (15 shared inter- 
actions, P = 1 x 10-9, Fisher’s exact test), most of 
the spontaneous suppression interactions iden- 
tified through SGA (92%) have not been reported 
previously; this indicates that the yeast genetic 
suppression network has remained largely un- 
explored. The experimentally derived suppression 
interactions showed similar significant enrich- 
ments as the literature-curated set for different 
types of genetic interactions, as well as for func- 
tionally related gene pairs, suggesting that sup- 
pression interactions in both networks define close 
functional relationships between genes and share 
the same basic properties (fig. S3, B and C). 


Suppression interaction magnitude 
correlates with functional relatedness 


Given that suppression interactions tend to con- 
nect functionally related genes, we examined wheth- 
er the relative magnitude of a given suppression 
interaction was indicative of the extent of func- 
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tional overlap. We estimated the relative magni- 
tude of suppression for our systematic interactions 
(table S4) (21), ranked the suppression pairs by 
suppression magnitude, and calculated the frac- 
tion of functionally related pairs for the 33% 
strongest and weakest suppression interactions 
(fig. S4). Gene pairs exhibiting more severe sup- 
pression interactions showed stronger enrichments 
for various measures of functional relatedness 
(fig. S4), in line with what has been described for 
positive and negative genetic interactions (2). 
Thus, large improvements in fitness appear to 
be caused by mutations in genes that are func- 
tionally similar to the query, whereas weaker 
suppression may be achieved by more general or 
diverse mechanisms. 


Systematic analysis identifies 
suppressor hubs 


The literature-curated network is enriched for 
genes involved in highly studied processes, such 
as chromatin and transcription, as well as DNA 
replication and repair (Fig. 3A). In contrast, in 
the experimentally derived network, queries and 
suppressors were more evenly spread over the 
various biological processes. As we found for 
the literature network (Fig. 2), genes involved 
in protein degradation were specifically over- 
represented as suppressors in the systematic study 
(Fig. 3A), which mainly reflects suppression of 
point-mutation alleles of essential queries. Al- 
though no significant functional enrichment 
was found for genes involved in RNA process- 
ing, nonsense-mediated mRNA decay genes sup- 
pressed several DAmpP alleles of essential genes 
(generated by decreased abundance by mRNA 
perturbation, DAmP) (22), which affect mRNA 
stability through disruption of their 3’ untranslated 
region. Thus, restoring protein or mRNA levels 
may represent a widespread mechanism to over- 
come growth defects caused by hypomorphic 
alleles. 

It is noteworthy that suppressed queries with 
roles in ribosome biogenesis and translation were 
underrepresented in the literature but overrepre- 
sented in our systematic data set (Fig. 3A). This 
enrichment was driven by a set of 34 query genes, 
each encoding a component of the mitochondrial 
translation machinery. All 34 queries were sup- 
pressed by missense mutations in the a, B, or y 
subunits of the F,; domain of the mitochondrial 
adenosine triphosphate (ATP) synthase, and the 
majority of the substituted residues were located 
at the interfaces between these subunits (Fig. 3B). 
Mutations in the same mitochondrial ATP syn- 
thase subunits also suppressed deletion alleles of 
mitochondrial DNA and RNA polymerase genes, 
as well as three relatively uncharacterized genes: 
IRC19, PET130, and YPR1I7W (table S2). All of these 
query mutations led to loss of the mitochondrial 
genome (mtDNA), which results in decreased 
growth due to a defect in the import of proteins 
into the mitochondria (23) (Fig. 3, C and D, and 
fig. S5A). The ATP synthase suppressor muta- 
tions could restore both fitness and mitochon- 
drial protein import in the absence of mtDNA 
(Fig. 3, C and D, and fig. S5B). Note that an 
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activity of the ATP synthase other than ATP 
synthesis was required for this suppression phe- 
notype (fig. S5, C and D). Although the mechanism 
by which the suppressor mutations increase 
protein import is unclear, one possibility is that 
the mutations reverse ATP synthase activity to 
generate ADP® instead of ATP*. The charge 
difference between these two nucleotide phos- 
phates could be exploited by adenine nucleotide 


translocators to rebuild the mitochondrial mem- 


brane potential, which is lost in the absence of 
mtDNA and is thought to be required for protein 
import into the mitochondria (Fig. 3D) (24). 


Suppressor identification can predict 
novel gene function 


The functional relationship observed between a 
query mutant and its suppressor can be exploited 
to assign gene function to previously uncharac- 


terized genes. For example, in our systematically 


mapped suppressor network, we found that 
loss-of-function mutations in an uncharacterized 
gene, YMROIOW, suppressed the growth defect 
of mon2A mutants (Fig. 4, A and B). Ymr010w 
belongs to the family of PQ-loop proteins, some 
of which function as membrane transporters 
(25), and localizes to both the Golgi and late 
endosomes (fig. S6A). Mon2 is distantly related 
to the Sec7 family of guanine nucleotide exchange 
factors and physically interacts with Dopl, a 
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Fig. 3. The mitochondrial F,; ATPase (adenosine triphosphatase) is a sup- 
pressor hub in the systematic suppression network. (A) The distribution 
of query and suppressor mutants in both the literature-curated and the sys- 
tematic experimental network across different biological processes. Node size 
reflects fold enrichment or depletion for query and suppressor mutants ob- 
served for a given biological processes. Significant enrichment or depletion was 
determined by Fisher's exact test, comparing the observed to the expected 
proportion of genes in each functional category. Bonferroni-corrected P values 
are indicated. (B) Bottom view, facing the inner membrane from the mito- 
chondrial matrix, of the yeast mitochondrial F; ATPase structure 2HLD. Res- 
idues that were found to suppress the growth defect of mitochondrial 
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transcription or translation mutants are highlighted in red. Orange spheres 
represent the nucleotides bound to the catalytic sites. (C) Fraction of wild- 
type and ATP synthase-mutant cells either with intact (p*) or (partially) deleted 
(p ) mtDNA that show mitochondrial localization of GFP fused to a mitochondrial- 
targeting signal (MTS-GFP). Averages (n = 4) and SD are shown. (D) Model of 
ATP synthase—dependent suppression of mitochondrial mutants (top) and 
corresponding representative images of MTS-GFP import (bottom). Localiza- 
tion of outer mitochondrial membrane protein mCherry-Fisl shows the pres- 
ence and position of mitochondria. ETC, electron transport chain; AY’, inner 
mitochondrial membrane potential; ANT, adenine nucleotide translocator. 
Scale bar, 5 um. 
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Fig. 4. Characterization of YMROIOW (ANY1). (A) Predicted membrane to- 
pology of Ymr010w. Sites of suppressor mutations, ubiquitination, and phos- 
phorylation are indicated. Single-letter abbreviations for the amino acid residues 
are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, 
Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
B) Suppression of the growth defect caused by a mon2A deletion allele, or TS 
alleles dop1-1 and neol-2, by deletion of YMROIOW. Series of 10-fold dilutions 
of exponentially growing cultures of the indicated strains were spotted on 
plates with YPD medium and incubated at either 22°C or 38°C for 2 days. 
C) Deletion of YMRO1OW restores membrane asymmetry in neol-2 cells. Wild- 
type, ymr010wA, neol-2 and neol-2 ymrO10WA cells were grown at 34°C in the 
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presence of the phosphatidylserine (PS) targeting peptide papuamide A, or the 
phosphatidylethanolamine (PE) targeting peptide duramycin. Growth relative 
to vehicle-treated wild-type strain is plotted. SEM is indicated by shading (n = 2 
to 3). (D) Intracellular distribution of PS, visualized using GFP-LactC2 (31). Shown 
are representative confocal fluorescent micrographs of exponentially growing 
cells of the indicated strains. The fraction of cells was calculated for each of 
the following groups: those (i) that showed diffuse cytosolic fluorescence or 
(ii) localization of GFP-LactC2 to the plasma membrane, or (iii) in which GFP- 
LactC2 was partially localized to distinct internal structures. Measurements were 
performed in triplicate on at least 100 cells, and averages are shown. (E) Model 
of suppression of flippase mutants by loss of YmrO10w. 
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conserved membrane protein involved in en- 
dosome to Golgi transport, as well as Neol, an 
essential member of the phospholipid flippase 
family (26, 27). When tested directly, we found 
that a ymr010wA deletion allele also suppressed 
the growth defects of neol-2 and dopI-1 TS mu- 
tants (Fig. 4B). Moreover, a ymr010wA deletion 
allele suppressed the lethality associated with 
deletion alleles of the essential genes NEO and 
DOPI (fig. S6B). Loss of YMROIOW function can 
thus bypass the requirement for the Mon2/Dop1/ 
Neol module. 

The essential function of the Mon2/Dop1/Neol1 
module is likely performed by Neol, which is 
thought to flip phosphatidylserine (PS) and phos- 
phatidylethanolamine (PE) from the exoplasmic 
to the cytoplasmic leaflet of membrane bilayers 
and thereby to establish an asymmetric distri- 
bution of these lipids (28). A neol-2 TS mutant 
is defective in establishing membrane asymmetry. 
This leads to hypersensitivity to papuamide A and 
duramycin, bioactive peptides that disrupt mem- 
branes through the binding of exposed PS and PE, 
respectively (28-30), and reduced plasma mem- 
brane localization of green fluorescent protein 
(GFP)-LactC2, a probe for visualizing the dis- 
tribution of PS over cytoplasmic membrane 
leaflets (31) (Fig. 4, C and D). Overexpression of 
YMROI0W also led to reduced levels of PS at the 
cytoplasmic leaflet of the plasma membrane, and 
accumulation of GFP-LactC2 in internal struc- 
tures (Fig. 4D), thus mimicking the phenotype of 
a neol-2 mutant. We found that a ymr010wA 
deletion allele suppressed both the sensitivity of 
a neol-2 TS mutant to papuamide A and duramycin 
(Fig. 4C), and the neol-2 GFP-LactC2 localization 
defect (Fig. 4D). The absence of these phenotypes 
suggests that the neol-2 phospholipid distribution 
defects are corrected in the double mutant. 

In addition to suppressing loss of Neol func- 
tion, a ymr010waA deletion allele suppressed the 
cold sensitivity caused by loss of the flippase 
Drs2 (fig. S6C). Moreover, neoJA lethality was no 
longer suppressed by ymr01J0wA in the absence 
of Drs2 (fig. S6C). An intriguing possibility is 
that Ymr010w functions as a scramblase that 
transports PS and PE bidirectionally to at least 
partially collapse the membrane asymmetry es- 
tablished by Neol and other flippases (Fig. 4). 
Deletion of YMROIOW would then allow Drs2, 
possibly with the help of other flippases, to more 
easily establish membrane asymmetry in the ab- 
sence of Neol. We named the YMROIOW open 
reading frame ANY/ for antagonizes Neol yeast 
phospholipid flippase. 


Frequent secondary mutations delay the 
onset of stationary phase 


Whole-genome sequencing revealed that, besides 
the suppressor mutation, each suppressor strain 
carried on average eight additional secondary 
mutations (table S5). Unlike the suppressor mu- 
tations, none of these secondary mutations af- 
fected exponential cell growth enough to be 
detected by SGA mapping analysis (table S3), 
suggesting the majority are random mutations 
that arose during DNA replication. We there- 
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fore refer to these additional secondary muta- 
tions as “passenger” mutations. We identified a 
similar number of passenger mutations in a con- 
trol set of 72 strains that did not carry a sup- 
pressor mutation that affects growth of the query 
mutant (table S5). Of the 304 strains that were 
sequenced at a coverage >10 times, only one 
query strain, deleted for PMS7 that encodes a 
mismatch repair protein, displayed a mutator 
phenotype, exhibiting a relatively large number 
(76) of passenger mutations. In total, we identi- 
fied 2024 unique passenger mutations, of which 
996 were in coding regions, affecting 744 protein- 
or RNA-encoding genes. The fraction of missense, 
nonsense, and frameshift mutations was substan- 
tially smaller among the passenger mutations 
than among the suppressor mutations (Fig. 5A). 
In fact, most of the passenger mutations (64%) 
resulted in synonymous changes or mapped to 
intergenic regions (Fig. 5A). Furthermore, pas- 
senger missense mutations occurred less fre- 
quently in essential genes, were predicted to be 
less deleterious, were less often at protein-protein 
interaction interfaces, and occurred more often in 
disordered protein regions than suppressor mis- 
sense mutations (Fig. 5B). Thus, the majority of 
the passenger mutations, which have no effect on 
exponential growth of the query strain, have a 
lower putative functional impact than the sup- 
pressor mutations that do affect query strain cell 
growth. 

A previous study suggested that deletion of 
a particular query gene may select for further 
genetic changes, such as the occurrence of spe- 
cific secondary nonsuppressor mutations (32). 
However, we did not observe a correlation be- 
tween the number of passenger mutations and 
the fitness of the query strain (fig. S7A). More- 
over, genes carrying passenger mutations do not 
tend to be coannotated or coexpressed with the 
corresponding query or suppressor gene (fig. S7B). 
In addition, we did not find any enrichment for 
particular GO terms among query genes that 
shared the same passenger mutation, or for 
shared passenger mutant genes among mul- 
tiple, independent isolates of a particular query 
mutant strain. However, we found that 10 strains 
that all carried a suppressor mutation in ATP2 
but had different query mutations involved in 
mitochondrial transcription or translation, also 
harbored a third mutation in HEMI, TPN1, or 
HAPI. These three genes are important for heme 
biosynthesis, and these mutations may thus be 
selected for to maintain heme homeostasis in the 
absence of mitochondrial transcription, transla- 
tion, or ATP synthase activity. Still, in most cases, 
different isolates of the same query suppressor 
strains did not contain mutations in the same 
passenger genes, and most passenger genes were 
not functionally related to either the query or the 
suppressor gene, indicating that passenger muta- 
tions are not generally dependent on preexisting 
mutations. 

We did find several genes that were mutated 
in a large fraction of the sequenced strains; this 
suggested that they may be adaptive and may not 
represent innocuous passenger mutations (Fig. 5C). 
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Of all sequenced strains, including wild-type con- 
trols, 29% carried unique mutations in WHI2, 
TRAI, IRA2, RIM15, CUP9, and/or UBC13. Multiple 
experimental evolution studies have identified a 
similar set of frequently mutated yeast genes 
(33-35). Most of the mutations were frameshift 
or nonsense mutations, suggesting a selection for 
loss-of-function of these genes (Fig. 5C). Expo- 
nential growth rates of whi2A, ira2A, rimI5A, 
and wbci3A deletion mutants were not enhanced 
relative to a his3A deletion mutant control. It thus 
appears that there was no selection for these fre- 
quent secondary mutations on the basis of an in- 
creased maximum growth rate (fig. S7C). Note 
that Whi2, Iral, Ira2, and Rim15 are all negative 
regulators of the RAS/cyclic adenosine mono- 
phosphate (cAMP)/protein kinase A (PKA) path- 
way, which, in response to glucose, stimulates 
population expansion (36-39). When glucose 
levels become limited, the RAS/cAMP/PKA path- 
way is repressed, thereby causing cells to stop 
dividing and enter stationary phase. Disruptive 
mutations in WHI2, IRAI, IRA2, or RIMI5 may 
cause a delayed response to low glucose levels 
that enables a few additional rounds of cell 
division before cells enter stationary phase and, 
thereby, lead to increased representation of these 
mutants after serial passaging under laboratory 
conditions. We constructed mixed populations 
consisting of a strain deleted for one of the fre- 
quently mutated genes and a wild-type strain 
and followed their ratio for six rounds of serial 
passaging under conditions with a relatively 
prolonged stationary phase (27). Indeed, the rela- 
tive abundance of strains deleted for WHI/2, 
IRA2, RIM15, or UBCI3 increased with each round 
of serial passaging, whereas five control mutant 
strains maintained abundances similar to or lower 
than the wild-type reference strain (Fig. 5D and 
fig. S7D). Similar results were obtained for [RAI 
and JRA2 in another strain background, W303 
(fig. S7, E and F). Thus, our data suggest that the 
vast majority of passenger mutations are random 
and not dependent on the query or suppressor 
mutation and, further, that a few additional sec- 
ondary mutations arise at high frequency be- 
cause of a selection for mutants that delay the 
onset of the stationary phase. 


Mechanistic categories of 
suppression interactions 


We classified the suppression interactions into 
distinct mechanistic categories on the basis of 
the functional relationship between query and 
suppressor. Most queries (54%) reported in the 
literature or identified by our systematic analysis 
are suppressed by mutations in functionally rela- 
ted genes (class “A”) (Fig. 6, A and B). These func- 
tional connections can be further divided into 
four subclasses. Subclass “Al” includes 135 in- 
teractions from the literature and systematic 
networks, in which both the query and the sup- 
pressor genes encode members of the same pro- 
tein complex. These particular interactions can 
reflect a mechanism whereby the suppressor 
represents a gain-of-function mutation (fig. S2A). 
Subclass “A2,” to which 201 interactions from our 
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Chromosomal coordinate 


Fig. 5. Characterization of potential passenger mutations. (A) Distribution 
of suppressor and potential passenger mutations over variant effect classes. 
Only SNPs are considered, as reliable structural variant calls (deletions, in- 
sertions, or inversions involving >5 base pairs) were only available for sup- 
pressor mutations. The RNA class refers to mutations in an RNA species such 
as a noncoding, ribosomal, or transfer RNA. (B) The fraction of all suppressor 
or potential passenger missense mutations that map to an essential gene, at a 
protein-protein interaction (PPI) interface, or at a disordered region of a protein, 
and the predicted deleteriousness of these mutations (SIFT scores: O = extremely 


Rounds of serial passaging 


deleterious and 1 = benign). P values were calculated using Fisher’s exact test, 
except for the SIFT analysis, in which a Mann-Whitney test was used. (C) The 
percentage of strains in which a particular gene carries a passenger mutation is 
plotted against the chromosomal position of the gene. Genes that are recurrently 
mutated in >2% of the sequenced strains are highlighted, and the distribution of 
the mutations over variant effect classes is shown. (D) Differentially fluorescently 
labeled cells of the indicated mutants [labeled with red fluorescent protein (RFP)] 
and wild type (GFP) were mixed, and the ratio of RFP to GFP was followed for six 
rounds of serial passaging on agar plates. Shading represents the SD, n = 12. 


network were assigned, describes cases where the 
query mutant growth defect is suppressed by a 
mutation in a gene that is annotated to the same 
pathway. In the case of loss-of-function suppres- 
sor mutations, the suppressor gene often has 
antagonistic effects compared with the query 
gene (e.g., Fig. 4). Subclass “A3” involves sup- 
pression by a different, but related, pathway 
and explains 195 interactions in our networks. 
In this scenario, the growth phenotype caused 
by absence of a specific cellular function re- 
quired for normal cell growth is suppressed when 
an alternative pathway is rewired to re-create the 
missing activity (e.g., Fig. 3). Finally, subclass “A4” 
consists of gene pairs that are annotated to the 
same biological process but for which pathway or 
complex annotation data were not available for 
both genes. 

In addition to suppression interactions between 
functionally related genes, suppression interactions 
involving hypomorphic (partial loss-of-function) 
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alleles—such as conditional TS alleles of essential 
genes—revealed a different and more general 
class of suppressors that affect expression of 
the query gene. This type of suppression (Figs. 2A 
and 3A) can be achieved by stabilizing a mutant 
mRNA or protein through the perturbation of 
pathways or complexes that regulate mRNA or 
protein turnover (Fig. 6A, class “B” and “C”). Al- 
though this type of suppression is rarely described 
in the literature, 48% of the hypomorphic queries 
in our experimental data set are suppressed by 
mutations in protein degradation or mRNA de- 
cay genes (Fig. 6B); this indicates that this type of 
allele-specific suppression is one of the main 
routes through which partial loss-of-function 
alleles can be suppressed. Of the suppression 
interactions, 60 to 70% fall into one of these 
mechanistic classes, as compared with only 34% 
of positive genetic interactions identified by SGA 
(6) and 11% of passenger-query pairs. Thus, pos- 
itive genetic interactions that are true suppres- 
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sion interactions often show high functional and 
mechanistic specificity. 


Discussion 


A global, literature-curated network of genetic 
suppression interactions (Fig. 1) showed that 
the majority of suppression interactions linked 
functionally related genes. Moreover, suppression 
interactions overlapped significantly with other 
types of genetic interactions (Fig. 2). Systematic 
suppression analysis confirmed these general prop- 
erties of suppression and further showed that 
suppression of hypomorphic alleles often occurs 
via loss of protein or mRNA degradation, a find- 
ing that was less obvious in literature-curated 
data (Fig. 6). The underrepresentation of this 
class of interactions in the literature is consistent 
with what has been reported for dosage suppres- 
sion interactions (73) and may reflect that mech- 
anistic studies focused on the functional analysis 
of a particular gene or pathway are less likely to 
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report nonspecific suppressors. Nevertheless, an 
understanding of the prevalence of this form of 
suppression could be important when interpret- 
ing a genotype-to-phenotype relationship. Even 
though the genes encoding proteasome or mRNA- 
decay components are essential in human cell 
lines (40-42), we anticipate that genetic varia- 
tion that subtly modulates the activity of these 
modules may exhibit genetic interactions asso- 
ciated with a decreased disease risk for a variety 
of human disorders. As in yeast, these processes 
may thus buffer a range of detrimental mutations 
in humans and, thereby, modify numerous dif- 
ferent disease phenotypes. 

Despite the prevalence of these general sup- 
pression mechanisms, most suppression gene pairs 
showed a close functional relationship (Fig. 6), so 
that genetic suppression can be used to assign 
function to a previously uncharacterized gene 
(Fig. 4). The suppressor interactions identified in 
our systematic screen resulted from the direct 
selection of spontaneous mutations during stan- 
dard laboratory growth of a query mutant whose 
fitness was compromised. In total, ~3% of strains 
in the yeast nonessential deletion mutant collec- 
tion and ~4% of the strains in the hypomorphic 
essential gene mutant collections showed evi- 
dence of a suppressor locus when screened by 
SGA. Whole-genome sequencing of 251 potential 
suppressor strains did not reveal any instances of 
suppression via aneuploidy, a mutational event 
involving copy number variation of many genes, 
possibly because aneuploidies are not necessarily 
revealed by SGA genetic mapping or because 
these events come at a fitness cost (43). Although 
SGA suppressor mapping can theoretically iden- 
tify multiple suppressor mutations within one 
strain (20), no query strains with multiple sup- 
pressor linkages were identified. This suggests 
that the direct selection for spontaneous sup- 
pressors does not mimic adaptive evolution of 
wild-type strains in nutrient-limited conditions, 
in which aneuploidies and mutations in multiple 
genes, each contributing small fitness increases, 
combine to collectively produce a robust sup- 
pression phenotype (35, 44). In contrast, we found 
that there is often a single direct suppression 
strategy because for most (~67%) of the queries 
for which we isolated several independent sup- 
pressor mutations, these recurrently occurred 
within the same single suppressor gene or within 
genes that encode subunits of the same com- 
plex. In addition, we found that large increases 
in fitness are mainly achieved by mutations in 
genes that have a close functional relation to 
the query gene (fig. S4). Thus, only a few, very 
specific mutational events appear to be able to 
substantially increase the fitness of a particular 
query mutant. 

Besides the suppressor mutation, each strain 
also carried, on average, eight additional passen- 
ger mutations that did not have a measurable 
effect on exponential growth rate. In a previous, 
but relatively limited, study, it was suggested that 
the deletion of a query gene in the deletion mu- 
tant collection often selects for further genetic 
changes (32). Although this is true for suppressor 
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Fig. 6. Mechanistic classes of suppression. (A) Suppressor and query genes often have a functional 
relationship (class “A”). Ina situation where the query (protein A) activates a protein B, which is required for 
normal growth, suppression can take place in multiple ways. For example, the suppressor (protein C) can 
be part of the same complex as the query, and gain-of-function mutations in C can restore the activation of 
B (class “A1”). Alternatively, the suppressor and query may be members of the same pathway, and the 
suppressor (protein D) may inactivate or inhibit B. Loss of D may thus suppress by partially restoring the 
activity of B (class “A2”). The suppressor (protein E) can also function in an alternative, but related, path- 
way, whose activity can be slightly altered to restore the activity of B (class “A3”). Suppression interactions 
can also occur among pairs of genes that do not share a close functional relationship. For example, partial 
loss-of-function query alleles may carry mutations that destabilize the protein or MRNA, leading to a 
fitness defect caused by reduced levels of the query protein. This can be suppressed by a loss-of-function 
mutation in a member of the protein degradation (class “B”) or mRNA decay (class “C”) pathway, which 
may partially restore the levels of the query protein. NMD, nonsense-mediated mRNA decay. (B) Dis- 
tribution of suppression interactions, positive genetic interactions (6), and passenger-query pairs across 
different mechanistic suppression classes. 
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mutations, we could not find any substantial 
evidence connecting the query or suppressor 
mutation to the occurrence of most passenger 
mutations. Because we did not observe a sig- 
nificant enrichment for functionally related gene 
pairs among queries and passengers (fig. S7B), we 
conclude that the occurrence of query-driven non- 
suppressor mutations is likely rare. 

In a mathematical model of bacterial serial 
passaging, de novo mutations that delay the 
onset of stationary phase were more likely to 
fix in a population than mutations that decrease 
lag time or increase growth or survival rates (45). 
This may be true for yeast as well, as the growth 
history of laboratory-grown yeast strains follows 
a similar pattern of repeating cycles of lag phase, 
exponential growth, and stationary phase. In- 
deed, we observe selection for mutations that 
likely delay the onset of stationary phase in 
26% of the sequenced strains (Fig. 5C). These 
stationary-delay mutations are thus not true 
“passenger” mutations but are adaptive. How- 
ever, in contrast to suppressor mutations that 
cause adaptation to the query mutation, the 
stationary-delay mutations are adaptive to lab- 
oratory passaging. These mutations could come 
at a cost, as they probably decrease viability dur- 
ing longer periods of starvation (35, 36). 

As most (78%) suppression interactions did not 
overlap with any previously identified genetic 
interactions, additional suppression mapping 
will help complete the yeast genetic interaction 
landscape. Conditional alleles have been devel- 
oped for nearly all essential yeast genes (6), and 
thus, suppression interactions could be mapped 
for the full set of essential genes. Similarly, sup- 
pressors of nonessential genes could be identi- 
fied in a conditional or synthetic lethal context in 
which the nonessential query has a fitness defect. 
Although we focused on mapping suppression 
interactions in yeast, similar suppression studies 
should be possible in mammalian cells and model 
systems and may identify new drug targets for 
query mutations related to human disease (46). 
As ~6% of human pathogenic variants are fixed 
in other mammalian species (47), compensa- 
tory mutations may be present at a high fre- 
quency in natural populations. Understanding 
genetic suppression may provide insight in how 
genetic variance accumulates during evolution 
and more specifically how modifier genes de- 
termine the severity of genetic traits, including 
human disease. 


Materials and methods 


Detailed materials and methods are available in 
the supplementary information. 


Literature curation 


The Saccharomyces cerevisiae “synthetic rescue” 
data set was downloaded from the BioGRID (9) 
on 9 November 2012 (version 3.1.49) and on 31 
March 2014 (version 3.2.110). In total, these data 
sets consisted of 5985 interactions described in 
1667 papers. Each paper was read in detail, and 
an interaction was considered a suppression in- 
teraction ifthe double mutant grew substantially 


aag0839-10 


better than at least one of the single mutants. 
For each interaction, suppressor and query al- 
lele type and specific conditions were annotated 
(21). The final data set consisted of 1842 unique 
interactions, involving 1304 genes (table S1). 


Systematic suppressor identification 


All suppressor strains were part of either the 
BY4741 nonessential deletion mutant collection 
(MATa xavA::kanMX4 his3A1 leu2A0 ura3A0 
met15A0; Euroscarf), the SGA nonessential dele- 
tion mutant collection LMATo wxxA::natMx4 
caniA::Ste2pr-Sp_his5 lyp1A his3A1 leu2A0 ura3A0 
meti5A0; (2)] or the corresponding MATa and 
MATao. collections of DAmP or TS mutants of 
essential genes (6). The presence and genomic 
location of a spontaneous suppressor mutation 
were identified by the occurrence of a suppressor 
linkage group upon screening strains in these 
collections by SGA analysis (20) (table S3). Po- 
tential suppressor strains were subsequently 
sequenced whole-genome on the Illumina HiSeq 
2500 platform using paired-end 100-bp reads. 
Read mapping and single-nucleotide polymor- 
phism (SNP), as well as indel calling were per- 
formed by using standard methods (27). Candidate 
suppressor mutations were confirmed by ampli- 
fying the corresponding gene and flanking se- 
quences by polymerase chain reaction, followed 
by Sanger sequencing (table S2). Suppression 
interactions were confirmed using plasmid-based 
complementation assays and tetrad analysis of 
meiotic progeny derived from crossing each sup- 
pressor strain to either a wild-type strain, a strain 
with a marked deletion that was genetically 
linked to the candidate suppressor, or a strain 
carrying a deletion or hypomorphic allele of the 
suppressor gene (table S2). 
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Microresonator soliton 
dual-comb spectroscopy 


Myoung-Gyun Suh,* Qi-Fan Yang,* Ki Youl Yang, Xu Yi, Kerry J. Vahalat 


Measurement of optical and vibrational spectra with high resolution provides a way to 
identify chemical species in cluttered environments and is of general importance in many 
fields. Dual-comb spectroscopy has emerged as a powerful approach for acquiring nearly 
instantaneous Raman and optical spectra with unprecedented resolution. Spectra are 
generated directly in the electrical domain, without the need for bulky mechanical 
spectrometers. We demonstrate a miniature soliton-based dual-comb system that can 
potentially transfer the approach to a chip platform. These devices achieve high-coherence 
pulsed mode locking. They also feature broad, reproducible spectral envelopes, an 
essential feature for dual-comb spectroscopy. Our work shows the potential for integrated 
spectroscopy with high signal-to-noise ratios and fast acquisition rates. 


ince their demonstration in the late 1990s 

(1-3), optical frequency combs have revo- 

lutionized precision measurements of time 

and frequency and have enabled new tech- 

nologies such as optical clocks (3). One re- 
markable method they make possible is dual-comb 
spectroscopy, which leverages the coherence prop- 
erties of combs for rapid broadband spectral anal- 
ysis with high accuracy (4-10). Frequency comb 
systems exist across a broad spectral range span- 
ning ultraviolet to infrared, making this method 
well suited for measurement of diverse molecular 
species (10). 

In parallel with advancements in frequency 
comb applications, the past decade has witnessed 
the appearance of miniature optical frequency 
combs or microcombs (11, 12). These microcombs 
have been demonstrated using several dielec- 
tric materials across a range of emission bands 
(11, 13-17). Under continuous-wave laser pump- 
ing, the combs are initiated by way of parametric 
oscillation (18, 19) and are broadened by cascaded 
four-wave mixing (11, 12) to spectral widths that 
can encompass an octave of spectrum (15). Four- 
wave mixing in the ultrafast intraband gain medium 
of quantum cascade lasers has also been shown 
to create frequency modulation (FM) combs (20). 
These FM systems have been applied to demon- 
strate dual-comb spectroscopy in the mid-infrared 
(21). Also, heterodyne of two conventional micro- 
combs in the mid-infrared has been demonstrated, 
a key step toward dual-comb spectroscopy (22). 

A major advancement in microcombs has been 
the realization of soliton mode locking (23-27). 
Soliton microcombs feature dissipative Kerr sol- 
itons that leverage the Kerr nonlinearity to both 
compensate dispersion and overcome cavity loss by 
way of parametric gain (23). Unlike earlier micro- 
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combs, this new device provides phase-locked 
femtosecond pulses with well-defined, repeatable 
spectral envelopes, which is important for dual- 
comb spectroscopy. The pulse repetition rate of 
the device is detectable, and it has excellent phase 
noise characteristics (24). In this work, we demon- 
strate dual-comb spectroscopy using this new plat- 
form. The dual-comb source spans >30 nm and 
the interferogram spectra feature high signal-to- 
noise ratios (SNRs). Also, precise microfabrication 
control enables close matching of the repetition 
rates so that >4 THz of optical bandwidth is mea- 
sured within 500 MHz of electrical bandwidth. 
Aschematic view of the dual-comb experimen- 
tal setup (Fig. 1) shows two soliton trains having 
different repetition rates (Af, = fi1 -f-2) generated 
from distinct microresonators and then combined 


Fig. 1. Microresonator- 
based dual-comb spec- 
troscopy. Two soliton 
pulse trains with slightly 
different repetition rates are 
generated by continuous- 
wave (CW) laser pumping 
of two microresonators. 
The pulse trains are com- 
bined in a fiber bidirectional 
coupler to produce a signal 
output path that passes 
through a test sample as 
well as a reference output 
path. The output of each 


CW Laser | 


CW Laser II 


Microresonator | 


Microresonator || >“ 


Absorption spectrum 


Optical Domain 


using a directional coupler. One of the combined 
streams is coupled through a gas cell of mole- 
cules (test sample in the figure) whose absorption 
spectrum is to be measured. The other combined 
stream provides a reference. The slight difference 
in repetition rates of the soliton streams creates 
a periodically time-varying interferogram in the 
detected current with a period 1/Af.. Fourier 
transform of this time-varying signal reveals the 
interfering soliton comb spectra, now shifted to 
radio-frequency rates. The signal spectrum con- 
taining the molecular absorption information 
is then normalized using the reference spectrum 
to reveal the spectral absorption of the gas cell. 
[See (28) for details of the experimental setup, 
methods, and equipment.] 

Solitons are generated and stabilized in two 
microresonators by means of the active-capture/ 
locking technique (fig. S1) (24, 28). The locking 
makes it possible for the two combs to remain 
stable indefinitely. Typical soliton optical spectra 
(Fig. 2, A and B) feature the characteristic sech? 
envelope, observed in this case over a 60-nm wave- 
length span. The detected electrical spectrum for 
each soliton source was measured using a spec- 
trum analyzer with a bandwidth of 26 GHz (Fig. 
2, C and D). The narrow spectral lines (resolu- 
tion bandwidth of 500 Hz) have SNRs greater 
than 75 dB, which shows that the corresponding 
repetition rates are extremely stable. The high-Q 
resonators used in this work are silica whispering- 
gallery devices fabricated on a silicon wafer by a 
combination of lithography and wet/dry etching 
(29). The unloaded quality factor of the micro- 
resonators is approximately 300 million, and the 
solitons have repetition rates of approximately 
22 GHz that were determined primarily by the 
diameter of the devices (3 mm). 

The optical outputs from the stabilized soliton 
sources are combined and coupled into two paths 
(Fig. 1 and fig. S1). One path contains a HCN gas 
cell (28), which functions as the test sample in the 
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path is detected (on Reference and Signal photo detectors) to generate an electrical interferogram of the 
two soliton pulse trains. The interferogram is Fourier-transformed to produce comb-like radio-frequency 
(RF) electrical spectra having spectral lines spaced by the repetition rate difference of the soliton pulse 
trains. The absorption features of the test sample can be extracted from this spectrum by normal- 
izing the signal spectrum by the reference spectrum. Also shown are two silica disk resonators. The 
disks have a diameter of 3 mm and are fabricated on a silicon chip. The nth optical comb frequency 
(Vai and vjz) for each soliton pulse train is given in terms of the respective repetition rate (f,; and f;2) 
plus an offset frequency (f.; and f.2). In the RF domain spectrum, the nth line occurs at a multiple of 
the difference in the repetition rates (Af, = fro — fr1) plus an offset frequency (Af, = fo2 — fo). 
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Fig. 4. Measured A 


molecular absorption 
spectra. (A) Absorp- 
tion spectrum of 2v3 
band of H'SCN 
measured by direct 
power transmission 
using a wavelength- 


Absorption 


Scanning laser 


calibrated scanning 


laser and comparison 
to the microresonator- 
based dual-comb 
spectrum. The residual 
difference between the 
two spectra is shown in 
green. (B) Overlay of 
the directly measured 


Absorption 


Dual comb 


optical spectrum and 
the dual-comb 


Residual 


spectrum, showing line- 
by-line matching. The 
vertical positions of the 


1550 
Wavelength (nm) 


two spectra are 
adjusted to compen- 
sate for insertion loss. 


Absorption 


measurement. The other path is coupled directly 
to a photodetector and functions as the reference. 
The test sample path also includes an alternate 
path (shown in fig. S1) in which a WaveShaper is 
inserted (28). Temperature control of one of the 
microresonators is used to tune the relative 
optical frequency difference of the two soliton 
streams. In our measurements, this difference 
was held below 1 GHz, allowing the observation of 
the temporal interferogram on an oscilloscope 
(bandwidth 1 GHz). 

The reference interferogram (Fig. 3A) has a pe- 
riod of 386 ns, corresponding to a soliton repe- 
tition rate difference of 2.6 MHz. This relatively 
small repetition rate difference was made possible 
by precise control of the resonator diameter (and 
hence the resonator free spectral range) using 
calibrated wet etching of the silica (29). It was 
possible to fabricate disks with even more closely 
matched repetition rates (<100 kHz). Figure 3B 
shows the calculated Fourier transform of the in- 
terferogram. The small repetition rate difference 
on the much larger 22-GHz soliton repetition rate 
makes it possible to compress an optical span of 
4 THz (1535 to 1567 nm) into 500 MHz of electri- 
cal spectrum. The measured wavelength span is 
actually narrower than the observable wavelength 
span of the original soliton pulse streams and is 
limited by the photodetector noise. The interfer- 
ogram spectrum has a SNR around 30 dB near 
the central lines. A zoom-in of the spectrum (multi- 
and single-line) is shown in Fig. 3C. The electrical 
comb lines are equidistantly separated by 2.6 MHz 
and have a linewidth (full width at half maximum) 
of <50 kHz, limited by the mutual coherence of the 
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independent fiber pump lasers. The pump laser 
line in a dissipative Kerr soliton is also a comb 
tooth in the soliton optical spectrum. As a result, 
the frequency jitter in each pump is transferred 
as an overall shift on the resulting soliton comb. 
Externally locking the two combs should reduce the 
observed linewidth in the interferogram spectrum. 

The pump lines are placed toward the high- 
frequency side (near 550 MHz) of the spectral max- 
imum in the interferogram spectrum (see Fig. 3B). 
In the optical spectra (Fig. 2, A and B), the pump is 
blue-detuned relative to the soliton spectral max- 
imum [this occurs on account of the Raman self- 
frequency shift of the soliton (24)]. This spectral 
landmark shows that the relative spectral place- 
ment of the soliton combs is such that high optical 
frequencies are mapped to high interferogram fre- 
quencies. The way in which certain nonidealities 
in the soliton spectra map into the interferogram 
spectrum is also of interest. Specifically, there are 
avoided mode crossing-induced Fano-like spurs 
(24) in the soliton optical spectra (Fig. 2, A and B) 
occurring near 1535 nm, and this generates a cor- 
responding feature at 750 MHz in Fig. 3B. 

As an initial test of the dual-comb source, syn- 
thetic absorption spectra were programmed in a 
WaveShaper and then measured as dual-comb 
spectra (28). In Fig. 3, D and E, Fourier transforms 
of the signal interferograms produced using the 
synthetic absorber are shown. The two program- 
med functions are a spectrally flat 3-dB absorp- 
tion and a sine-wave absorption having a 4-dB 
amplitude. The synthetic absorption spectra, ob- 
tained by normalizing the signal and reference 
electrical spectra, are compared with the program- 


med functions in Fig. 3, D and E. The ability to 
reconstruct these synthetic spectral profiles clear- 
ly demonstrates the reproducibility of the soliton 
spectral profile. 

Finally, we studied the absorption spectrum of 
the HCN 2v; band. In Fig. 4A, the measured dual- 
comb absorption spectrum from 1538 to 1562 nm 
is shown in red and compared with a directly mea- 
sured absorption spectrum shown in blue. Both 
absorption spectra are normalized. A laser locked 
to a molecular absorption line enabled absolute 
frequency calibration of the spectra (28). Sampling- 
induced choppiness of the dual-comb spectrum is 
caused by the relatively coarse spectral resolution 
of the solitons in comparison to the spectral scale 
of the HCN absorption lines. Nonetheless, the 
characteristic envelope of the HCN 2v; band is 
clearly resolved. The residual difference between 
the two absorption spectra is shown in green in 
Fig. 4A; the calculated standard deviation is 0.0254. 
Furthermore, a line-by-line overlay of the measured 
optical and dual-comb spectra (Fig. 4B) confirms 
the wavelength precision and absorption intensity 
accuracy of the dual-comb source. The directly 
measured HCN absorption spectrum is obtained 
by coupling an external cavity diode laser (ECDL) 
into the H™CN gas cell and then scanning the 
laser while monitoring the transmitted optical 
power. A separate signal is also tapped from the 
ECDL to function as a reference (28). 

In principle, a soliton source with finer comb 
spacing (i.e., lower repetition frequency) is possi- 
ble. Nonsoliton microcombs with mode spacings 
as narrow as 2.4 GHz have been demonstrated 
using the silica resonator platform (J6). The use 
of electro-optical modulators to modulate the mi- 
crocombs by a fraction of the repetition frequency 
is another possible way to create a finer spectral 
comb grid. Also, tuning of the combs is, in princi- 
ple, possible so as to create a nearly continuous 
high-resolution absorption measurement. 

The dual-comb source is centered near 1550 nm 
(in the C-band) for this work; however, operation 
at other wavelengths within the transmission 
window of silica is also possible. Moreover, using 
fiber nonlinear broadening or internal (resona- 
tor) dispersive wave generation (25), it should be 
possible to greatly extend the comb spectral span. 
Resonator dispersion engineering (30) can also be 
used to extend comb bandwidth. More generally, 
a wide range of materials are available for micro- 
combs enabling access to mid-infrared spectra. 
With further improvements, it should also be 
possible to realize chip-based dual-comb coher- 
ent anti-Stokes Raman spectroscopy (CARS). The 
potential for monolithic integration with other 
devices makes soliton-based microcombs well 
suited for possible realization of a dual-comb spec- 
troscopic system-on-a-chip. 
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A coherent Ising machine for 
2000-node optimization problems 


Takahiro Inagaki,'* Yoshitaka Haribara,”’** Koji Igarashi,” Tomohiro Sonobe,*”® 
Shuhei Tamate,* Toshimori Honjo,’ Alireza Marandi,’ Peter L. McMahon,’ 
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Shoko Utsunomiya,* Hiroki Takesue™* 


The analysis and optimization of complex systems can be reduced to mathematical 
problems collectively known as combinatorial optimization. Many such problems can be 
mapped onto ground-state search problems of the Ising model, and various artificial spin 
systems are now emerging as promising approaches. However, physical Ising machines 
have suffered from limited numbers of spin-spin couplings because of implementations 
based on localized spins, resulting in severe scalability problems. We report a 2000-spin 
network with all-to-all spin-spin couplings. Using a measurement and feedback scheme, 
we coupled time-multiplexed degenerate optical parametric oscillators to implement 
maximum cut problems on arbitrary graph topologies with up to 2000 nodes. Our coherent 
Ising machine outperformed simulated annealing in terms of accuracy and computation 


time for a 2000-node complete graph. 


ombinatorial optimization is an important 
task for various applications such as drug 
discovery, resource optimization in wire- 
less networks, and machine learning (1). 
Many such problems are known to be 
nondeterministic polynomial time (NP)-hard or 
NP-complete problems, which are considered dif- 
ficult to solve efficiently with modern digital com- 
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puters. Several heuristic or “greedy” algorithms 
have been proposed in the field of computer science 
to find the optimal or near-optimal solutions to 
those problems. Among such algorithms, simu- 
lated annealing (SA) has performed well in solv- 
ing various types of combinatorial optimization 
problems (2). It is known that combinatorial 
optimization problems can be mapped onto 
ground-state search problems of the Ising model 
with polynomial resources (3). Several approaches 
have been proposed and demonstrated to find 
solutions to Ising problems using networks of 
artificial spins based on various physical sys- 
tems (4-7). 

A coherent Ising machine (CIM) is one such 
artificial Ising machine, in which a laser (8) or a 
degenerate optical parametric oscillator (DOPO) 
(9) represents an artificial spin. This proposal was 
motivated by early studies on the quantum behav- 
iors of a DOPO (10-14), and proof-of-principle 
CIM experiments have already been reported 
using time-multiplexed DOPOs (15, 16). To date, 
physical Ising machines (4-7) have suffered from 


a limited number of spin-spin couplings due to 
localized spin implementations. For example, in 
arecent quantum annealing system based on 1152 
superconducting qubits, the number of spin-spin 
couplings was 3360, constrained by the chimera 
graph structure (17). This limited number of spin- 
spin couplings has made it difficult to map various 
real-world optimization problems to the hard- 
ware. Here, we report a CIM that consists of 2048 
DOPOs with full spin-spin couplings. A measure- 
ment and feedback (MFB) scheme enabled us to 
implement all possible connections among 2048 
spins, which amounts to 2,096,128 spin-spin coupl- 
ings, making it possible to solve arbitrary graph 
problems. 

An Ising Hamiltonian without an external mag- 
netic field is given by 


H(o) =-) Jyoin, (sigs N) (1) 


i<j 


(18), where o; € {-1,+1} denotes the ith Ising spin, 
Jy is the interaction strength between the ith and 
jth spins, and N is the total number of spins. The 
Ising machines described above are designed to 
find the ground state or low-energy states of the 
Ising Hamiltonian. A DOPO takes only a 0 or x 
phase relative to the pump phase above the thresh- 
old, so it can be used as a stable artificial spin 
realized with photonics technologies (79-22). A 
CIM solves the Ising problem according to the 
minimum-gain principle (8). According to this prin- 
ciple, if the optical couplings between the DOPOs 
are adjusted to implement {J;}, the DOPO net- 
work oscillates in the phase configuration with the 
lowest loss, which corresponds to the ground state 
of the Ising Hamiltonian in Eq. 1. 

The size of the problem that a CIM can handle 
is determined by the number of DOPOs and their 
couplings. Delayed interferometers were used to 
couple the DOPOs in the previous CIM experi- 
ments (15, 16); however, this scheme does not 
scale as the problems become more complex. To 
avoid this issue, Haribara et al. proposed an 
MEB scheme that makes it possible to couple any 
combination of N DOPOs (23). 

Figure 1 shows a schematic of a CIM with MFB 
(24). The CIM uses a periodically poled lithium 
niobate (PPLN) waveguide, which is placed in a 
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1-km fiber cavity, as a phase-sensitive amplifier 
(PSA) that amplifies only the 0 or x phase com- 
ponents relative to the pump phase (25, 26). Other 
components in the 1-km fiber cavity include a 9:1 
coupler (coupler 1) for extracting the DOPO pulses 
for balanced homodyne detection (BHD), a 1-km 
dispersion-shifted fiber, and another 9:1 coupler 


1-km fiber cavity 


1 
i 
i 
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i 
Phase sensitive | 
amplification .-* | 
Cs .. 1 


“~ 768nm | 
, pulses 
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(coupler 2) for injecting feedback signal pulses. 
When we start pumping the PSA, quadrature- 
squeezed noise pulses are generated as a result 
of spontaneous parametric downconversion. The 
noise pulses undergo repeated phase-sensitive am- 
plification, which leads to the formation of DOPO 
pulses. Because the cavity round-trip time is 5 us 


Measurement & feedback 


Local 
oscillator {igh 


FPGA 
module 


Pump pulse preparation 


Fig. 1. Setup of a coherent Ising machine (CIM) with measurement and feedback (MFB). |M, 
intensity modulator; EDFA, erbium-doped fiber amplifier; SHG, second harmonic generation; FS, 
piezo-based fiber stretcher; DSF, dispersion-shifted fiber. 
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and the pump pulse interval is 1 ns, the cavity can 
accommodate a maximum of 5082 DOPOs, of which 
2048 DOPOs are used as Ising spins (henceforth 
“signal DOPOs”). Another 2834 DOPOs, which we 
call “training DOPOs,” are always being oscillated 
and are used to obtain error signals for phase- 
locking the 1-km fiber cavity, the local oscillator 
for the BHD and the coupling pulses (24). As a 
result, we can perform sequential computations 
stably for a few seconds. The remaining 200 DOPOs 
are turned off (i.e., the pump pulses are turned off 
at the corresponding time slots) to separate the 
signal and training DOPO pulses. 

To perform a computation, we turn the signal 
DOPOs on and off by controlling the power of 
the pump pulses for the signal DOPOs in a 5-ms 
period (fig. S2A). During the computation, the in- 
phase components {é;} of the signal DOPOs from 
coupler 1 are measured using the BHD for every 
circulation of the DOPOs in the cavity. The power 
of the local oscillator for the BHD is 2.5 mW, with 
which the shot noise power is ~8 dB larger than 
the thermal noise power, and thus a shot noise- 
dominant coherent measurement is achieved. The 
measured {€;} is supplied to a field-programmable 
gate array (FPGA) module. The given problem, 
specified by the spin-spin interaction terms {Jj} 
(a 2048 x 2048 adjacency matrix), is input in ad- 
vance into the FPGA module. The current FPGA 
module only accepts J;; values of {-1, 0, 1}. With 
{é;} and {J,}, the FPGA module calculates the 
feedback signal to the ith pulse f; = —rUjJé; for 
every DOPO circulation, where 7 denotes a con- 
stant that determines the coupling strength and 
was set at ~0.01 in the following experiments. A 
push-pull modulator imposes the feedback signal 
ff; on the coupling pulses, which are extracted from 
the pump pulses. Then, the modulated coupling 
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Fig. 2. DOPO time evolutions when solving the MAX-CUT problem for complete graph K2o00- (A) Histogram of 2000 signal DOPO time evolutions. (B) In- 
phase component dynamics of the 50 sampled DOPOs. Inset shows the magnification of the area outlined by the red rectangle. 
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Fig. 3. Cut values obtained for the MAX-CUT problem on 2000-node graphs with the CIM, SA, and GW-SDP. Best known refers to the highest cut 


value yet reported with a conventional algorithm. 


pulses are launched into the cavity through cou- 
pler 2 so that a coupling pulse that conveys f; is 
injected into the 7th DOPO. Note that this MFB 
sequence is repeated throughout the entire com- 
putational sequence. With the current FPGA mod- 
ule, it takes about ~60 s to transfer the problem 
matrix and the computation result data be- 
tween the FPGA module and the computer, be- 
cause of the use of a slow serial communication 
interface (RS-232C) that is the only available 
port in the module. The data transfer time can 
be much shorter than the computation trial time 
if we make use of high-speed interfaces (24). 

We studied the computational accuracy of the 
CIM and compared it with SA run on a single core 
of an Intel Xeon X5650 (2.67 GHz) processor for 
2000-node graphs. With the CIM and SA, we solved 
maximum-cut (MAX-CUT) problems (24) for three 
types of 2000-node undirected graphs: G22, G39, 
and Koooo. G22 and G39 are sparsely connected 
graphs taken from the G-set (27). G22 is a random 
graph with a 1% edge density, and G39 is a scale- 
free graph in which the node degree distribution 
follows power-law scaling. Kyogq is a fully connected 
complete graph with 1,999,000 undirected edges, 
which is randomly weighted by {-1, +1}. These 
three instances were randomly constructed with 
a machine-independent graph generator (28). We 
also solved the same graphs using the semidefinite 
programming relaxation algorithm proposed by 
Goemans and Williamson (GW-SDP), because this 
algorithm guarantees to deliver solutions to MAX- 
CUT problems with an expected value of at least 
87.8% of the optimal value (29). We used the cut 
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values obtained by GW-SDP as an index for the 
computational accuracy. 

An example of the temporal evolution of the 
signal DOPO amplitudes observed while solving 
the Kyog9 graph is shown in Fig. 2A, which port- 
rays the cooperative behavior of the DOPOs lead- 
ing to the final stable states after frequent changes 
of their signs at the beginning of the computation. 
To show the detailed dynamics of each signal 
DOPO, we plot the measured in-phase components 
é; of 50 of the 2000 signal DOPOs in Fig. 2B. As 
shown in the inset of Fig. 2B, we observed the 
evolution of the DOPO amplitudes searching 
for a lower-energy state almost immediately af- 
ter the pumping started; the dynamics was com- 
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Fig. 4. Time evolutions of Ising energy obtained 
with CIM (blue curve) and SA (black curve) when 
solving the complete graph Kago9- [he dotted line 
corresponds to the target Ising energy of -60,278 
obtained with GW-SDP. 


plex, especially in the first 200 circulations. This 
result confirmed a reduction in the Ising energy 
(fig. S4). 

Using the CIM, we performed 100 sequential 
computations for MAX-CUT on the three graphs. 
The cut values obtained with the CIM and SA for 
each graph are summarized in Fig. 3. Here, the 
final DOPO states for each computation trial were 
stably recorded sequentially 100 times, thanks 
to the phase stabilization of the optical system. 
The SA result was obtained in 50 ms for each trial 
without the data read and write time. With GW- 
SDP, a computation trial took about 1 s. For all 
three graphs, the CIM achieved larger cut values 
than those obtained by GW-SDP, which confirmed 
that a physical system based on the CIM concept 
successfully worked to find solutions to the large- 
scale Ising problems with as many as 2000 spins. 
SA produced better values than the CIM when 
solving sparse graphs G22 and G39, as confirmed 
by the cut histograms in Fig. 3. In contrast, the 
cut obtained by the CIM for the Kyooq graph ex- 
ceeded that obtained by SA with one-tenth of the 
computation time needed for SA. 

We then compared the computation time 
needed to find an approximate solution for the 
MAX-CUT of the Kogoo graph. Because the exact 
solution of this problem is not known, we cannot 
evaluate the computation time to find the optimal 
solution. Hence, we measured the computation 
times needed with the CIM and SA to reach the 
target Ising energy obtained with GW-SDP, namely 
-60,278 (or 29,619 cuts). The temporal evolution 
curves for the CIM and SA are shown in Fig. 4 by 
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blue and black curves, respectively. Here, we per- 
formed the computation 100 times; the curves 
indicating the shortest times needed to reach the 
target energy are shown for both the CIM and 
SA. The times needed to reach the target Ising 
energy were about 70 us for the CIM and 3.2 ms 
for SA, implying that the CIM could obtain the 
benchmark Ising energy more quickly than 
with SA by a factor of nearly 50. This result sug- 
gests that a CIM can outperform SA on a CPU 
when solving dense graph problems. 

According to the numerical study of Maruo et al. 
(14), the formation of a superposed state is ob- 
served when an optically coupled CIM is operated 
in the shot noise-limited regime. This suggests 
that the CIM may exhibit quantum behavior, 
although experimental confirmation is yet to be 
achieved. A more detailed discussion on the quan- 
tumness of the CIM is provided in (24). 
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QUANTUM GASES 


Universal space-time scaling 
symmetry in the dynamics of bosons 
across a quantum phase transition 


Logan W. Clark,* Lei Feng, Cheng Chin 


The dynamics of many-body systems spanning condensed matter, cosmology, and beyond 
are hypothesized to be universal when the systems cross continuous phase transitions. 
The universal dynamics are expected to satisfy a scaling symmetry of space and time 
with the crossing rate, inspired by the Kibble-Zurek mechanism. We test this symmetry 
based on Bose condensates in a shaken optical lattice. Shaking the lattice drives 
condensates across an effectively ferromagnetic quantum phase transition. After crossing 
the critical point, the condensates manifest delayed growth of spin fluctuations and 
develop antiferromagnetic spatial correlations resulting from the sub-Poisson distribution 
of the spacing between topological defects. The fluctuations and correlations are 
invariant in scaled space-time coordinates, in support of the scaling symmetry of quantum 


critical dynamics. 


ritical phenomena near a continuous phase 

transition reveal fascinating connections be- 

tween seemingly disparate systems that can 

be described via the same universal princi- 

ples. Such systems can be found in the con- 
texts of superfluid helium (J), liquid crystals (2), 
biological cell membranes (3), the early universe 
(4), and cold atoms (5, 6). An important universal 
prediction is the power-law scaling of the to- 
pological defect density with the rate of cross- 
ing a critical point, as first discussed by T. Kibble 
in cosmology (4) and extended by W. Zurek in 
the context of condensed matter (). Their the- 
ory, known as the Kibble-Zurek mechanism, has 
been the subject of intense experimental study 
that has largely supported the scaling laws (7). 
Recent theoretical works further propose the 
so-called universality hypothesis, according to 
which the collective dynamics across a critical 
point should be invariant in the space and time 
coordinates that scale with the Kibble-Zurek 
power law (8-10). 

Atomic quantum gases provide a clean, well- 
characterized, and controlled platform for study- 
ing critical dynamics (6, I, 12). They have enabled 
experiments on the formation of topological defects 
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across the Bose-Einstein condensation transition 
(13-16) as well as critical dynamics across quan- 
tum phase transitions (17-23). Recent experiments 
using cold atoms in shaken optical lattices (24-26) 
have provided a vehicle for exploring phase tran- 
sitions in spin models (27-29). 

Here we study the critical dynamics of Bose 
condensates in a shaken optical lattice near an 
effectively ferromagnetic quantum phase tran- 
sition. The transition occurs when we ramp the 
shaking amplitude across a critical value, causing 
the atomic population to bifurcate into two pseudo- 
spinor ground states (28). We measure the growth 
of spin fluctuations and the spatial spin correla- 
tions for ramping rates varied over two orders of 
magnitude. Beyond the critical point, we observe 
delayed development of ferromagnetic spin do- 
mains with long-range antiferromagnetic corre- 
lations due to the bunching of the domain sizes, 
which is not expected in a thermal distribu- 
tion of ferromagnets. The times and lengths 
characterizing the critical dynamics agree with 
the scaling predicted by the Kibble-Zurek mech- 
anism. The measured fluctuations and correla- 
tions collapse onto single curves in scaled space 
and time coordinates, supporting the universal- 
ity hypothesis. 

Our experiments use elongated three-dimensional 
(3D) Bose-Einstein condensates (BECs) of cesium 
atoms. We optically confine the condensates with 
trap frequencies of (@,, @y ,@;) = 2m x (12, 30,70) 
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Hz, where the long (a’) and short (y’) axes are 
oriented at 45° with respect to the w and y co- 
ordinates (Fig. 1A). The tight confinement along 
the vertical z axis suppresses nontrivial dynamics 
in that direction (see the discussion on the dynam- 
ics in the y direction below), which is also the 
optical axis of our imaging system. We adiabati- 
cally load the condensates into a 1D optical lat- 
tice (17) along the x axis with a lattice spacing 
of 532 nm and a depth of 8.86 Ep, where 
ER = h x 1.33 kHz is the recoil energy and h is 
Planck’s constant. 

To induce the ferromagnetic quantum phase 
transition, we modulate the phase of the lattice 
beam to periodically translate the lattice potential 
by Az(t) = (s/2)sin(wt), where s is the shaking 
amplitude and the modulation frequency @ is 
tuned to mix the ground and first excited lattice 
bands (fig. S1) (28, 30). The hybridized single- 
particle ground band energy € can be modeled for 
small quasimomentum q = (gz, Gy, qz) by 


te 2 
(ass) = a(s)a + 0()at + (1) 


where mm is the atomic mass, and the coefficients 
of its quadratic (a) and quartic (8) terms depend 
on the shaking amplitude (Fig. 1B). For shaking 
amplitudes below the critical value, the coefficient 
o. is positive and the BEC occupies the lone ground 
state at momentum q = 0. The quantum phase 
transition occurs when the quadratic term crosses 
zero at S = S,, where ao = O and f > O. At this 
point, the speed of sound for superfluid excita- 
tions, formally studied in (37), drops to zero along 
x but remains nearly constant along y and zg. Even 
stronger shaking converts the ground band into 
a double well with a < 0, yielding two degenerate 
ground states with g, = +qg*. Repulsively interact- 
ing bosons with this double-well ground band are 
effectively ferromagnetic, having two degenerate 
many-body ground states with all atoms either 
pseudo-spin up (¢, = g*) or down (qx = —q*) (28). 
Notably, transitioning to one of these two ground 
states requires the system to spontaneously break 
the symmetry of its Hamiltonian. Describing the 
dynamics across the critical point presents a major 
challenge because of the divergence of the cor- 
relation length of quasimomentum and the relax- 
ation time (critical slowing). 

The Kibble-Zurek mechanism provides a power- 
ful insight into quantum critical dynamics. Accord- 
ing to this theory, when the time remaining to 
reach the critical point inevitably becomes shorter 
than the relaxation time, the system becomes ef- 
fectively frozen (Fig. 1C). The system only unfreezes 
at a delay time tz, after passing the critical point, 
when relaxation becomes faster than the ramp. 
At this time, topological defects become visible, 
and the typical distance dxz between neighbor- 
ing defects is proportional to the equilibrium 
correlation length. The Kibble-Zurek mechanism 
predicts that txz and dxz depend on the quench 
rate $ as 


BN 


tz <$",a= 
a : 1+2v 


(2) 
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where zg and v are the equilibrium dynamical and 
correlation length exponents given by the univer- 
sality class of the phase transition. Although the 
details of this picture may not apply to every phase 
transition, the general scaling arguments are very 
robust, and similar predictions hold for a variety 
of quench types across the transition (12) and for 
phase transitions that break either continuous or 
discrete symmetries (7). 

For slow ramps, tz and dz diverge and be- 
come separated from other scales in the system, 
making them the dominant scales for character- 
izing the collective critical dynamics (8-10). This 
idea motivates the universality hypothesis, which 
can be expressed as 


f(a@,t;8)ooF (Za) (4) 
dxz, tz, 

indicating that the critical dynamics of any col- 
lective observable f obeys the scaling symmetry 
and can be described by a universal function F of 
the scaled coordinates x/dxz, and t/txz. The only 
effect of the quench rate is to modify the length 
and time scales. 

We test the scaling symmetry of time by mon- 
itoring the emergence of quasimomentum fluctu- 
ations at different quench rates. Here, fluctuations 
refer to deviations of quasimomentum from zero, 
which vary across space and between individual 
samples; fluctuations should saturate to a large 
value when domains having q = +q* are fully 
formed. After loading the condensates into the 
lattice, we ramp the shaking amplitude linearly 


A 
Z 
x Pd ' alle 
< Ss 
x 
B 


Dispersion E(q,) 


Se Shaking Amplitude s 


from s = 0 to values well above the critical ampli- 
tude s, = 13.1 nm (82) and interrupt the ramps 
at various times to perform a brief time-of-flight 
(TOF) before detection. After TOF, we measure 
the density deviation 5n(r) = n4(r) — m(r) - 
(n(x) - m(r)) (82), which is nearly proportion- 
al to the quasimomentum distribution (fig. $2), 
where 7,;(1r) is the density profile of the 7th Bragg 
peak and the angle brackets denote averaging 
over multiple images. This detection method is 
particularly sensitive near the critical point when 
the quasimomentum just starts deviating from 
zero, indicating the emergence of fluctuations 
in the ferromagnetic phase where the ground 
states have nonzero quasimomentum. The spin 
density measurement used later to study spa- 
tial correlations is viable only when atoms set- 
tle to g = tq". 

Over a wide range of quench rates, the evolu- 
tion of quasimomentum fluctuation can be de- 
scribed in three phases (Fig. 2A). First, below the 
critical point, quasimomentum fluctuation does 
not exceed its baseline level. Second, just after 
passing the critical point, critical slowing keeps 
the system “frozen,” and fluctuation remains low. 
Finally, the system unfreezes and quasimomen- 
tum fluctuation quickly increases and saturates, 
indicating the emergence of ferromagnetic do- 
mains. We quantify this progression by investi- 
gating the fluctuation of contrast Ac = (8n?/n?) 
(Fig. 2B) that tracks quasimomentum fluctuation 
in our condensates via the fluctuation of 5n (fig. 
$2), where 7 is the total density and the angle 
brackets denote averaging over space and over 
multiple images. For comparison between differ- 
ent quench rates, we calculate the normalized 


fluctuation AC = (Ac — Ac;) /(Ace — Aci), where 


‘ Relaxation 


Time |t | 


0 Quantum Critical S 
Point 


Fig. 1. Ferromagnetic quantum phase transition of bosons in a shaken optical lattice. (A) A BEC of 
cesium atoms (spheres) in a 1D optical lattice (pink surface) shaking with peak-to-peak amplitude s can 
form ferromagnetic domains (blue and red regions). The elliptical harmonic confinement has principal 
axes rotated 45° from the lattice. (B) The transition occurs when the ground band evolves from qua- 
dratic for s < S. [paramagnetic (PM) phase], through quartic at the quantum critical point s = Sc, toa 
double well for s > Ss, [ferromagnetic (FM) phase] with two minima at q, = +q* (28). (C) Kibble-Zurek 
picture. Evolution of the condensate crossing the phase transition becomes diabatic in the frozen regime 
(cyan) when the time t remaining to reach the critical point is less than the relaxation time. Faster ramps 
cause freezing farther from the critical point, limiting the system to smaller domains. Sample domain 
images are shown for slow, medium, and fast ramps. 
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subscripts i and f indicate the fluctuation at early 
and late times, respectively (32). We find empiri- 
cally that the growth of normalized fluctuations 
is well fit by the function 


AC(t) = st tanh (“ =) (5) 


where the time t¢ is defined relative to when the 
system crosses the critical point at t = 0, tq char- 
acterizes the delay time when the system un- 
freezes, and ¢; is the formation time over which 
the fluctuation grows. The measurement of fluc- 
tuation over time provides a critical test for both 
the Kibble-Zurek scaling and the universality hy- 
pothesis. First, both ¢g and ¢ exhibit clear power- 
law scaling with the quench rate § varied over more 
than two orders of magnitude (Fig. 2C). Power-law 
fits yield the exponents of @q = 0.50(2) and a = 
0.50(6), respectively. The nearly equal exponents 
are suggestive of the universality hypothesis, which 
requires all times to scale identically. Indeed, the 
growth of contrast fluctuation AC follows a uni- 
versal curve when time is scaled by ta (Fig. 2D), 
strongly supporting the universality hypothesis 
(Eq. 4). Note that any observable time characteriz- 
ing the collective dynamics can be chosen as fxz, in 
Eq. 4, including tg and ¢,. 

We next test the spatial scaling symmetry based 
on the structures of pseudo-spin domains that 
emerge after the system unfreezes. Here, we cross 
the critical point with two different protocols: The 
first is a linear ramp starting from s = 0, whereas 
the second begins with a jump to s = s,, followed 
by a linear ramp. We detect domains near the 
time ¢ = 1.4¢q in the spin density distribution 
Jz(¥) = n,(r)—n_(r) based on the density n,/_ 
of atoms with spin up/down (fig. S3). At this time, 
the spin domains are fully formed and clearly sep- 
arated by topological defects (domain walls), as 
shown in Fig. 3A. Furthermore, choosing this time 
just after domain formation minimizes the time 
available for nonuniversal relaxation processes. 
We characterize the domain distribution with the 
spin correlation function (17, 28) 


G(r) = (fi(R+ni,(R)aR) (6) 


averaged over multiple images (Fig. 3B). Both 
ramping protocols lead to similar correlation 
functions, suggesting that the domain distribu- 
tion is insensitive to increases in the quench rate 
below the critical point. 

The spin correlations reveal rich domain struc- 
ture that strongly depends on the quench rate. For 
slower ramps § < 1.3 nm/ms, the structures are 
predominantly one-dimensional and the density of 
topological defects increases with the quench rate. 
The tighter confinement and finite speed of sound 
near the critical point along the y and z axes allow 
spin correlations to span the gas in those direc- 
tions. The dynamics thus appear one-dimensional. 
When the quench rate exceeds 1.3 nm/ms, defects 
start appearing along the y axis, and the domain 
structures become multidimensional. We attri- 
bute this dimensional crossover to the unfreezing 
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Fig. 2. Growth of quasimomentum fluctuation in quantum critical dynamics. (A) Sample images 
show the emergence of nonzero local quasimomentum via the density deviation between Bragg peaks 6n 
as the system is linearly ramped across the ferromagnetic phase transition; four ramps with increasing 
quench rates (bottom to top) are shown. Each ramp exhibits three regimes: a subcritical regime before 
the transition; a frozen regime beyond the critical point where the fluctuation remains low; and a growth 
regime in which fluctuation increases and saturates, indicating domain formation. Time t = O corresponds 
to the moment when the system reaches the critical point. (B) Quasimomentum fluctuation for ramp 
rates § = 3.6 (triangles), 0.91 (Squares), 0.23 (diamonds), and 0.06 nm/ms (circles) arises at a delay time 
t = ty over a formation time ts. Fluctuation is normalized for each ramp rate to aid comparison (32). The 
solid curves show fits based on Eq. 5. (©) The dependence of ty (circles) and t; (squares) on the quench 
rate is well fit by power laws (solid curves) with scaling exponents of ag = 0.50(2) and a; = 0.50(6), 
respectively. The inset shows ty ona linear scale. (D) Fluctuations measured for 16 ramp rates from 0.06 to 
10.3 nm/ms collapse to a single curve when time is scaled by ty based on the power-law fit. The solid curve 
shows the best fit based on the empirical function (Eq. 5), and the gray shaded region covers one standard 
deviation. Error bars in (B) and (C) indicate one standard error. 


time becoming too short to establish correla- 
tion along the y axis. For the remainder of this 
work, we focus on the slower quenches and in- 
vestigate the spin correlations along the 2 axis. 
We examine the 1D correlations using line 
cuts of the density-weighted correlation functions 
g(r) = G(r) /(In(R+ r)n(R)dR) (17, 28). The 
results exhibit prominent decaying oscillation (Fig. 
3C). We extract two essential length scales from 
the correlation functions: the average domain size 
d, or equivalently, the distance between neighbor- 
ing topological defects, and the correlation length 
&, indicating the width of the envelope function. 


These two scales are determined from the position 
and width of the peak in the Fourier transform of 
&(2) (82). 

These length scales enable us to test the spatial 
scaling symmetry. The lengths d and &€ both dis- 
play power-law scaling consistent with the Kibble- 
Zurek mechanism (Fig. 3D), with fits yielding 
exponents bg = 0.26(2) for the domain size and 
bz = 0.26(5) for the correlation length. Similarly, 
the correlations, measured at the same scaled time, 
collapse to a single curve in spatial coordinates 
scaled by the domain size d (Fig. 3E). This result 
strongly supports the spatiotemporal scaling from 
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Fig. 3. Antiferromagnetic spatial correlations from quantum critical 
dynamics. (A) Two sample images at each quench rate exemplify spin domains 
measured near the time t = 1.4ty after crossing the phase transition. These 
images correspond to linear ramps starting at s = O (§ = 0.32 to 5.12 nm/ms) 
and s = Sc (§ = 0.04 to 0.16 nm/ms). (B) Spin correlation functions Go(r) = 

G(r)/G(O) (Eq. 6) are calculated from ensembles of 110 to 200 images. (C) Cuts 
across the density-weighted correlation functions go(x) = g(x)/g(0) are shown 
for quench rates $ = 1.28 (triangles), 0.45 (squares), 0.16 (diamonds), and 
0.056 nm/ms (circles). Solid curves interpolate the data to guide the eye. The 
typical domain size d and the correlation length € are illustrated for 0.056 nm/ms 
by the arrow and dashed envelope, respectively (32). (D) The dependence of 


d (green) and € (black) on the quench rate is well fit by power laws (Eq. 3) with 
spatial scaling exponents of by = 0.26(2) and bg = 0.26(5), respectively. Marker 
shape indicates linear ramps starting at s = O (Squares) or at s = S, (circles). The 
inset shows the results on a linear scale. Error bars indicate 1 SE. (E) Correlation 
functions for $ = 0.04 to 1.28 nm/ms collapse to a single curve when distance is 
scaled by the domain size extracted from the power-law fit in (D). The solid curve 
shows the fit based on Eq. 7; the gray shaded area covers 1 SD. (F) The temporal 
scaling exponents ag and a; from Fig. 2C (magenta) and the spatial scaling ex- 
ponents by and bg from (D) (green) constrain the critical exponents v and z ac- 
cording to Eqs. 2 and 3 with 68% (dark) and 95% (light) confidence intervals. The 
cross marks the best values with contours of 68 and 95% overall confidence (32). 


the universality hypothesis (Eq. 4). An empirical 


curve 
(@) =e we e cos lees (7) 
Bit) = XP \ 93 yd 


provides a good fit to the universal correlation 
function, yielding o = 1.01(1) and y = 1.041), indi- 
cating that the width of the envelope is close to 
the typical domain size. 

The most striking feature of the universal cor- 
relation function is the emergence of oscillatory, 
antiferromagnetic order in the ferromagnetic 
phase. In thermal equilibrium, ferromagnets are 
expected to have a finite correlation length but 
no anticorrelation. The appearance of strong anti- 
correlation at v = d suggests that domains of size 
d form preferentially during the quantum critical 
dynamics. A statistical analysis of the topological 
defect distribution reveals that the domain sizes 
are bunched with their standard deviation og = 
0.31(2)d well below their mean, indicating that 
the topological defects are created by a sub-Poisson 
process (fig. S4). 
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Finally, the combined scaling exponents of space 
and time allow us to extract the equilibrium critical 
exponents based on the Kibble-Zurek mechanism 
(33) (Fig. 3F). Solving Eqs. 2 and 3, we obtain the 
dynamical exponent zg = 1.9(2) and correlation 
length exponent v = 0.52(5), which are close to the 
mean-field values z = 2 and v = 1/2 up to our ex- 
perimental uncertainty. Note that the dynamical 
critical exponent z = 2 results from the unique 
quartic kinetic energy « = Bq‘ of our system at the 
critical point (32). 

Direct identification of domain walls presents 
intriguing possibilities for future studies of 
the topological defects generated during critical 
dynamics. These opportunities would be par- 
ticularly interesting if the shaking technique 
were extended to higher dimensions in such 
a way that the transition breaks a continuous 
symmetry. In addition, the scaling of the cor- 
relation functions suggests that the antiferro- 
magnetic order may be a shared feature of 
quantum critical dynamics for phase transitions 
in the same universality class, meriting future 
experiments. 


REFERENCES AND NOTES 


1. W.H. Zurek, Nature 317, 505-508 (1985). 
2. |. Chuang, R. Durrer, N. Turok, B. Yurke, Science 251, 
1336-1342 (1991). 
3. S.L. Veatch, 0. Soubias, S. L. Keller, K. Gawrisch, Proc. Natl. 
Acad. Sci. U.S.A. 104, 17650-17655 (2007). 
4. T.W.B. Kibble, J. Phys. Math. Gen. 9, 1387-1398 (1976). 
5. Q. Zhou, T. L. Ho, Phys. Rev. Lett. 105, 245702 (2010). 
6. A. Polkovnikov, K. Sengupta, A. Silva, M. Vengalattore, Rev. 
Mod. Phys. 83, 863-883 (2011). 
7. A. del Campo, W. H. Zurek, Int. J. Mod. Phys. A 29, 1430018 
(2014). 
8. M. Kolodrubetz, B. K. Clark, D. A. Huse, Phys. Rev. Lett. 109, 
015701 (2012). 
9. A. Chandran, A. Erez, S. S. Gubser, S. L. Sondhi, Phys. Rev. B 
86, 064304 (2012). 
. A. Francuz, J. Dziarmaga, B. Gardas, W. H. Zurek, Phys. Rev. B 
93, 075134 (2016). 
1. |. Bloch, J. Dalibard, W. Zwerger, Rev. Mod. Phys. 80, 885-964 
(2008). 
. J. Dziarmaga, Adv. Phys. 59, 1063-1189 (2010). 
3. C. N. Weiler et al., Nature 455, 948-951 (2008). 
. G. Lamporesi, S. Donadello, S. Serafini, F. Dalfovo, G. Ferrari, 
Nat. Phys. 9, 656-660 (2013). 
5. L. Corman et al., Phys. Rev. Lett. 113, 135302 (2014). 
6. N. Navon, A. L. Gaunt, R. P. Smith, Z. Hadzibabic, Science 347, 
167-170 (2015). 
. L.E. Sadler, J. M. Higbie, S. R. Leslie, M. Vengalattore, 


oO 


i 


as 


N 


D. M. Stamper-Kurn, Nature 443, 312-315 (2006). 


4 NOVEMBER 2016 * VOL 354 ISSUE 6312 609 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


RESEARCH | REPORTS 


18. K. Baumann, R. Mottl, F. Brennecke, T. Esslinger, Phys. Rev. 
Lett. 107, 140402 (2011). 

19. D. Chen, M. White, C. Borries, B. DeMarco, Phys. Rev. Lett. 106, 
235304 (2011). 

20. E. Nicklas et al., Phys. Rev. Lett. 115, 245301 (2015). 

21. S. Braun et al., Proc. Natl. Acad. Sci. U.S.A. 112, 3641-3646 
(2015). 

22. C. Meldgin et al., Nat. Phys. 12, 646-649 (2016). 

23. M. Anquez et al., Phys. Rev. Lett. 116, 155301 (2016). 

24. N. Gemelke, E. Sarajlic, Y. Bidel, S. Hong, S. Chu, Phys. Rev. 
Lett. 95, 170404 (2005). 

25. H. Lignier et al., Phys. Rev. Lett. 99, 220403 (2007). 

26. J. Struck et al., Phys. Rev. Lett. 108, 225304 (2012). 

27. J. Struck et al., Science 333, 996-999 (2011). 


28. C. V. Parker, L.-C. Ha, C. Chin, Nat. Phys. 9, 769-774 
(2013). 

29. G. Jotzu et al., Nature 515, 237-240 (2014). 

30. W. Zheng, B. Liu, J. Miao, C. Chin, H. Zhai, Phys. Rev. Lett. 113, 
155303 (2014). 

31. L.-C. Ha, L. W. Clark, C. V. Parker, B. M. Anderson, C. Chin, 
Phys. Rev. Lett. 114, 055301 (2015). 

32. Materials and methods are available on Science Online. 

33. J. Xu et al., Europhys. Lett. 113, 50003 (2016). 


so 


ACKNOWLEDGMENTS 


We thank L.-C. Ha, C. V. Parker, B. M. Anderson, B. J. DeSalvo, 
A. Polkovnikov, S. Sachdev, and Q. Zhou for helpful discussions. 
L.W.C. was supported by a National Defense Science and 


Engineering Graduate Fellowship and a Nambu Fellowship. This 
work was supported by NSF Materials Research Science and 
Engineering Centers (DMR-1420709), NSF grant PHY-1511696, and 
Army Research Office-Multidisciplinary Research Initiative grant 
W9IINF-14-1-0003. The data presented in this paper are available 
upon request to C.C. (cchin@uchicago.edu). 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/354/6312/606/suppl/DC1 
Materials and Methods 

Figs. S1 to S4 

References (34, 35) 


28 April 2016; accepted 26 September 2016 
10.1126/science.aaf9657 


PHYSICS 


Buffer-gas cooling of antiprotonic 
helium to 1.5 to 1.7 K, and 
antiproton-to-electron mass ratio 


Masaki Hori,’* Hossein Aghai-Khozani,’ Anna Sotér,’ Daniel Barna,” Andreas Dax,” + 
Ryugo Hayano,” Takumi Kobayashi,?+ Yohei Murakami,” Koichi Todoroki,?§ 
Hiroyuki Yamada,” Dezsé Horvath,”* Luca Venturelli’ 


Charge, parity, and time reversal (CPT) symmetry implies that a particle and its 
antiparticle have the same mass. The antiproton-to-electron mass ratio M;/me can be 
precisely determined from the single-photon transition frequencies of antiprotonic helium. 
We measured 13 such frequencies with laser spectroscopy to a fractional precision of 
2.5 x 10°? to 16 x 10°°. About 2 x 10° antiprotonic helium atoms were cooled to 
temperatures between 1.5 and 1.7 kelvin by using buffer-gas cooling in cryogenic low- 
pressure helium gas; the narrow thermal distribution led to the observation of sharp 
spectral lines of small thermal Doppler width. The deviation between the experimental 
frequencies and the results of three-body quantum electrodynamics calculations was 
reduced by a factor of 1.4 to 10 compared with previous single-photon experiments. From 
this, M;/me was determined as 1836.1526734(15), which agrees with a recent proton-to- 


electron experimental value within 8 x 107°. 


echniques to cool atoms containing antipar- 
ticles such as antihydrogen (J, 2), positro- 
nium (3, 4), or antiprotonic helium (pHe*) 
(5, 6) allow their spectral lines to be mea- 
sured with narrow thermal Doppler widths 
and strong signal intensities. Their transition fre- 
quencies can then be determined with the highest 
precision, thus increasing the sensitivity of the 
experiments to the consistency of charge, parity, 
and time reversal (CPT) symmetry (7), which im- 
plies that particles and antiparticles have equal 
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masses and absolute values of the charge. Spec- 
tral data on DHe* have also been used to set 
upper limits on a possible fifth force at angstrom- 
length scales (8, 9). Buffer-gas cooling techniques 
(0, 11) have been used to cool many ordinary 
atoms and molecules that lack the closed-cycle 
transitions suitable for laser cooling. It relies on 
elastic collisions between the atom of interest 
with a cryogenic buffer gas to allow the system to 
reach thermal equilibrium. Here, we report on the 
application of this method to cool large (2 x 10°) 
numbers of p*He* and p’He* isotopes to tem- 
peratures T = 1.5 to 1.7 K. 

Metastable pHe' is a three-body exotic atom 
(fig. S1) that contains a helium nucleus, an electron 
occupying the Is state, and an antiproton in a 
Rydberg state of principal- and angular-momentum 
quantum numbers 7 ~ J + 1 ~ 38 (5, 12-15). The 
antiproton that occupies these circular orbitals 
that lie ~80 keV above the 1s antiprotonic state 
retains microsecond-scale lifetimes against anni- 
hilation in the nucleus because it is localized 
away from the nucleus. Before this experiment, it 
was not known whether collisions with normal- 


matter atoms would cool pHe? to such low (1.5 K) 
temperatures because the corresponding multi- 
body calculations were complicated. In fact, other 
exotic atoms such as pionic hydrogen (16) were 
heated by collisions that deexcite the atom; pHet 
atoms colliding with other helium atoms annihi- 
lated (5) the antiprotons occupying some states. 
We adjusted the density of the buffer helium gas 
so that the pHe* atoms, once formed, rapidly 
underwent a few hundred or more cooling col- 
lisions before being interrogated by the laser 
beam. The 1s electron protected most of the 
pHe* atoms during this cooling. 

The pHe' transition frequencies have recently 
been calculated (12-15) to a precision of ~10~'° by 
evaluating the complete set of quantum electro- 
dynamics (QED) corrections up to order m,a” in 
atomic units. These a priori calculations used the 
International Council for Science Committee on 
Data for Science and Technology (CODATA) 2010- 
recommended values of the fundamental con- 
stants (17), including the fine structure constant 
a, the *He- and “He-to-electron mass ratios, the 
Bohr radius, and the Rydberg constant. By com- 
paring the calculated and experimental pHe* fre- 
quencies, the Mz/m, ratio can in principle be 
determined to a fractional precision of <1 x 10° 
(12-15), which may rival the best measurements 
of the proton-to-electron mass ratio M,,/m, (18-22). 
Previous measurements (23) of the DHe* single- 
photon frequencies, however, differed by up to 
8 x 10°° from the calculations available at the 
time, which corresponded in some cases to 2 to 
3 standard experimental deviations. A major 
source of experimental uncertainty was the Dopp- 
ler broadening of the spectral lines. We later 
measured a different set of two-photon resonances 
using a Doppler-cancellation technique (6) in 
which pHe* atoms were irradiated by a pair of 
counterpropagating laser beams. Three transi- 
tion frequencies involving the n < 36 states were 
measured with this complicated method. Assum- 
ing CPT invariance, the results were included 
in the CODATA2010 data set in order to de- 
termine the electron mass m, in atomic mass 
units; however, further improvements in the 
precision of the determination of M;/m, were 
limited by the poor experimental precision of 
the single-photon frequencies (23) that had 
higher sensitivities ky (5) to Mz /me and m,, as 
compared with those of the two-photon lines 
(Table 1). The Doppler-broadened single-photon 
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lines of buffer-gas-cooled DHe* observed in the 
present work—together with improved lasers, 
detectors, and a hybrid magnetic and electro- 
static beamline—yielded M5/m, and m, values 
with a precision beyond that of the previous sub- 
Doppler two-photon experiment (6). 

In our experiment, HHe* atoms were formed 
by allowing antiprotons to come to rest in a 
helium buffer gas in thermal contact with an 
open-cycle Joule-Thomson cryocooler at T= 1.3 K 
(Fig. 1). About 3% of the antiprotons replaced 
one of the 1s electrons of helium and formed 
metastable pHe*. Targets of low density were 
used to reduce the effects of collisional shifting of 
the spectral lines; for this, antiproton beams of 
sufficiently low energy and emittance that could 
be made to come to rest in the targets were needed. 
The experiments were carried out at the Anti- 
proton Decelerator of the European Organiza- 
tion for Nuclear Research (CERN), which provided 
a pulsed beam containing 2 x 10’ to 3 x 10” 
antiprotons of kinetic energy EF = 5.3 MeV and 
repetition rate 0.01 Hz. About 25% of these 
were decelerated to E = 75 keV by allowing them 
to traverse a radiofrequency quadrupole deceler- 
ator. The emerging beam had a large emittance 
e = 50 to 100 x mm mrad and an energy spread 
of AE = 10 keV. To transport this to the position 
of the experimental target while preventing the 
growth of beam halos, we constructed a 2-m-long 


1.3 K cryocooler 


achromatic beamline (24). This reduced the beam 
diameter (<15 mm) in the target by a factor of 2 
compared with previous experiments (6, 23). 
Each arrival of an antiproton pulse synthesized 
10° pHe* atoms. Some 3 x 10? to 15 x 10°atoms 
populated the resonance parent state at the 
moment of laser irradiation, depending on the 
transition. Data from >1.5 x 10° cold p*He* 
atoms and >5 x 10° cold p’Het atoms were col- 
lected over a 3-year period. 

The target chamber was machined out of a 
150 by 150 by 150 mm block of oxygen-free high- 
conductivity copper (Fig. 1). The antiprotons en- 
tered through a 1.4-.m-thick, 28-mm-diameter 
window made of biaxially oriented polyethylene 
terephthalate (BoPET) supported by a grid of 
gold-coated tungsten wires. Target “He gas of 
purity >99.999% introduced into the chamber 
condensed into a superfluid film on the inner 
walls cooled to a constant temperature of T= 1.3 K; 
cold gas that evaporated from this was allowed 
to collide with the antiprotons. No such film 
formed for the *He, or when the pressure of “He 
in the target (measured by capacitance manom- 
eters) was low. By controlling the amount of gas 
introduced into the sealed chamber, the pressure 
was stabilized at several settings between P = 40 
and 170 Pa, corresponding to atomic target den- 
sities between p = 2 x 10° and 9 x 10° cm™. A 
30-mm-diameter laser beam entered through a 


for 


Antiproton beam 


fused silica window with a major diameter of 
70 mm, which was fixed on one side of the cham- 
ber at Brewster’s angle. The laser beam inter- 
sected the antiproton beam at a 90° angle and 
exited through a second window on the opposite 
side. This prevented any retroreflected laser light 
from irradiating the pHe* atoms and deforming 
the spectral line shape, which was a problem in 
the previous single-photon experiment (23) based 
on collinear antiproton and laser beams. Three 
1-m-long copper baffles cooled to T <7 K reduced 
the thermal radiation entering through the large 
windows. Within the cryocooler, liquid helium 
was cooled to JT ~ 2.2 K in a recuperative heat 
exchanger, before being expanded through a 
throttle valve, which provided further cooling to 
T ~ 1.3 K. The coolant condensed in a heat ex- 
changer that was evacuated though a 150-mm- 
diameter cryogenic pumping line by means of a 
roots blower and rotary vane pump with respec- 
tive speeds of 2000 and 800 m*/hour. A 1-m- 
thick concrete wall isolated the target from the 
80 dB acoustic noise of the pumps. The temper- 
ature was stabilized to within 0.03 K by control- 
ling this throttle valve using a stepping motor. 
Dye and Ti:sapphire laser systems generated 
the 40- to 100-ns-long laser pulses with peak 
powers P = 0.5 to 10 kW and wavelengths A = 264 
to 841 nm needed to excite the DHe* atoms 
within their microsecond-scale lifetimes. Their 


Fig. 1. Schematics of the experiment. (Top) Drawing of 
the experimental setup used to synthesize antiprotonic 
helium atoms and cool them with buffer gas cooling to 
T =1.5 K. (Bottom) Drawing of the laser systems used 


spectroscopy. AOM, acousto-optic modulator; DPSS, 


diode-pumped solid state; EOM, electro-optic modulator; 
KTP, potassium titanyl phosphate; LBO, lithium triborate; 
BBO, B-barium borate; PD, photodiode. 
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Signal intensity (arb.u.) 


Laser frequency offset 


Fig. 2. Resonance profiles of the single-photon transitions. (A) (n, /) = 
(37, 35) = (38, 34), (B) (39, 35) > (38, 34), and (C) (38, 35) > (39, 34) of buffer-gas 
cooled p*He* atoms. The narrow, intense spectral lines are compatible with the 
Doppler widths of atoms cooled to T = 1.5 to 17 K, relative to the corresponding 
transition frequencies between v = 356 and 502 THz and when the Auger widths 
of the resonance daughter states are subtracted. The arrows indicate the theo- 


linewidths I = 6 to 20 MHz were a factor of 4 to 
10 narrower than those used in previous single- 
photon experiments (23). In the first system, a 
ring dye laser pumped by an Ar-ion laser pro- 


(GHz) 


retical positions of the hyperfine lines caused by the spin-spin interaction between 
the antiproton and electron (26); the (B) and (C) splittings had not been observed 
previously. (D) The profile (36, 34) > (37, 33) of cooled B?He* shows a further 
splitting, due to the °He nuclear spin. The x-abscissa indicates the offset of the 
optical frequency of the laser relative to the resonance centroid. Blue curves indi- 
cate the best fit of an ab initio model based on the optical Bloch equation (30). 


eee EEE 
Table 1. Transition frequencies of p*He* and p*He* atoms. For experimental values, total, statistical, 
and systematic uncertainties are given in parentheses. Theoretical frequencies have uncertainties from 
numerical errors of the calculation and uncalculated QED terms. «jy and kg are the sensitivities of different 


duced continuous-wave beams of P = 1 W, A = 
593 to 672 nm, and [ < 0.1 MHz. Its frequency 
was stabilized by using the Pound-Drever-Hall 
technique to a temperature-stabilized, 75-mm- 
long Fabry-Perot cavity that was suspended in 
a vacuum chamber. The seed laser frequencies 
were measured to a precision of <2 x 10°'° by 
using a Ti:sapphire femtosecond optical comb 
that was stabilized to a global positioning satellite- 
disciplined, quartz oscillator. This seed beam 
was amplified to a pulse energy E = 2 to4 mJ in 
two dye cells pumped by a Q-switched, single- 
longitudinal-mode Nd:YAG laser of E = 180 mJ 
and A = 532 nm. In the previous single-photon 
experiment (23), a systematic uncertainty arose 
from the spectral broadening and shift during 
the dye amplification; this was caused by rapid 
modulations in the intensity of the pump laser. 
We reduced this uncertainty by developing a 
pump laser (24) with a long (At = 40 to 60 ns), 
smooth temporal profile with a single longitu- 
dinal mode. 

In the second system, a continuous-wave Ti: 
sapphire laser pumped by a Nd:YVO, laser 
generated seed beams of P = 1 W, A = '723 to 941 nm, 
and I < 0.1 MHz. This beam was injection- 
seeded into a 1-m-circumference Ti:sapphire cav- 
ity (6, 25), which produced 40- to 100-ns-long 
laser pulses of E = 4 to 13 mJ. For measuring the 
blue and ultraviolet (UV) resonances, the beam 
was further amplified to E = 15 to 40 mJ ina Ti: 
sapphire multipass amplifier, before being 
frequency-doubled or -tripled in lithium triborate 
(LBO) and £-barium borate (BBO) crystals. The 
absolute precision of the pulsed dye and Ti: 
sapphire laser systems of <2 x 10-9 and <1.4 x 
10 °, respectively, was verified by measuring some 
transition frequencies in Cs continuously over a 
2-week period. 

The laser pulses excited transitions between 
pHe? states with 1- to 2-us lifetimes and states that 
led to Auger emission of the Is electron with nano- 
second lifetimes. The PHe?* ion that remained 
after Auger decay was rapidly destroyed by Stark 
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transitions to antiproton mass and charge, respectively. ppb, parts per billion. 


Transition Experiment frequency 
(n, 1) > (1, 1’) (MHz) 
p ee 


356155990.1(2.1)(2.0)(0.8) 


412885133.1(1.0)(0.8)(0.6) 


(34, 33) > (35, 32) 


655062100(10)(10)(1) 


Theor. frequency Ku kg 
(MHz) (ppb) (ppb) 


356155990.5(4) 


412885132.4(2) 


655062101.92(7) Dil 34 


(32, 31) > G1, 30) 


1132609226 .7(2.8)(2.5)(1.4) 


1132609224.01(8) 3S 24 


(38, 34) > (37, 33) 505222282(4)(4)(1) 


505222281.0(3) 8 31 


(36, 33) > (35, 32) 646180416(5)(4)(1) 


646180412.58(5) 6 


(32, 31) > (21, 30) 


1043128581(5)(4)(1) 


effects, which mixed the Rydberg ionic states 
with s, p, and d states at high n during colli- 
sions with helium atoms. The charged pions that 
emerged from the resulting antiproton anni- 
hilations in the nucleus were detected with a 
Cherenkov detector (24), thus revealing the reso- 
nance condition between the laser and pHe™ as 
a sharp spike in the annihilation rate (fig. S2). 
The profile of the p*He™ transition (n, J) = 
(37, 35) — (38, 34) is shown in Fig. 2A, obtained 
by plotting the intensities of the annihilation 
signals induced at laser frequencies between 
-1 and 1 GHz around the resonance centroid. 
Each data point was collected from 6 to 10 anti- 
proton pulses. Pairs of fine structure sublines 
arise from the dominant interaction between the 
orbital angular momentum of the antiproton and 
the electron spin; the positions of the four hyper- 
fine sublines (26) that arise from the additional 
spin-spin interaction between the antiproton and 
electron are indicated in Fig. 2 with arrows. This 


1043128580.64(8) 3) 


single-photon resolution exceeds those of previ- 
ous sub-Doppler two-photon spectroscopy exper- 
iments (6) and is consistent with the Doppler 
width = v\/8kTlog2/Mc? = 160 to 170 MHz of 
p'He* thermalized to T = 1.5 to 1.7 K when the 
contributions of power broadening (<30 MHz), 
natural width (20 MHz), laser line width (6 MHz)— 
which includes the Fourier transform limit (25) due 
to the pulse length of the laser —and the hyperfine 
structure (<30 MHz) are subtracted. Collisional 
broadening effects are small (27). Here, k de- 
notes the Boltzmann constant, M is the atom’s 
mass, and c is the speed of light. The strong 
signal intensity (3x larger than in the previous 
high-resolution single-photon experiment rela- 
tive to the total number of p*He* atoms formed 
in the target) arises from the high density of cold 
pHe* that lies within the thermal velocity class 
excited by the laser. The profile of a previously 
undetected p*He* resonance (38, 35) > (39, 34) 
is shown in Fig. 2C. Its spectral resolution is 
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Fig. 3. Single-photon transition frequencies of antiprotonic helium, and the antiproton-to-electron 
mass ratio determined from them. (A) Comparison of experimental and calculated transition fre- 
quencies. Shown are the fractional differences between the experimental (open circles) and theoretical 
(squares) values of 13 transition frequencies of D‘Het and D°He* atoms cooled to T = 1.5 to 17 K. 
(B) Antiproton-to-electron mass ratio compared with proton-to-electron mass ratios. From top to 
bottom are the most precise direct measurement (18) of the proton-to-electron mass ratio obtained 
by contrasting the cyclotron frequencies of a proton and an electron confined in a Penning trap; 


indirect determinations (19-21) by measuring the magnetic moment of the bound electron in 
ions in a Penning trap; laser spectroscopy of HD* molecules (22); antiproton-to-electron mass 


and 1697+ 


1205+ 


ratio obtained from two-photon laser spectroscopy of p*He* (6); and single-photon spectroscopy of cold 
atoms in this work. The yellow band represents the CODATA2010-recommended value (17). The 


CODATA2014 value is similar to the value of (21). 


limited by the >1-GHz Auger width of the 
daughter state. Previous experiments could not 
resolve the fine structure in favored transitions 
of the type (n, 1) > (n - 1, 1 - 1), but it is now 
revealed in the (39, 35) — (38, 34) resonance of 
cold p*He* (Fig. 2B). We also resolved the three- 
peak structure in the p*He* resonance (36, 34) 
— (37, 33) (Fig. 2D) arising from the eight 
unequally spaced hyperfine sublines caused by 
the additional interaction of the *He nuclear 
spin. Altogether, we measured 13 resonances, 
which represent most of the metastable DHe* 
population (28). The fact that they were much 
narrower than the single-photon resonances 
measured previously (23) with inferred Doppler 
widths of 160 to 470 MHz relative to their 
transition frequencies of v = 356 to 1133 THz, and 
no substantial broadened background compo- 
nent, indicates that nearly all of the atoms had 
cooled to T = 1.5 to 1.7 K. Some of the transi- 
tions [ (34, 33) (35, 32) of p*He* (fig. S3)] were 
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broadened because of the short (t = 170 ps) Auger 
lifetime of the resonance daughter state (14). 

The spin-independent transition frequencies 
Vexp (Table 1) were determined by fitting the pro- 
files with a theoretical line shape (Fig. 2, blue 
lines). The line shape was obtained by solving the 
optical Bloch equations of the single-photon tran- 
sitions between the (2/ + 1) ~ 70 magnetic sub- 
states and taking into account power-broadening 
effects, thermal motion of the atoms, the spurious 
frequency modulation in the laser, and the spatial 
and temporal profiles of the laser beam. The 
positions of the hyperfine sublines were fixed to 
the theoretical values (26), the uncertainty of 
which affects v,,., by 0.1 MHz (table SI). 

Cold collisions with helium atoms deform the 
pHe? orbitals and shift the resonance frequencies 
(27, 29). As shown in fig. S4, A and C, the fre- 
quency shifts of two D* He* transitions, (38, 35) > 
(39, 34) and (37, 35) — (38, 34), have the largest 
observed gradients dv/dp, measured at target den- 


sities between p = 8 x 108 and 8 x 10° cm™. The 
much smaller shifts of other transitions, such as 
(32, 31) — (31, 30), were not resolvable within the 
2-MHz experimental uncertainty. The measured 
shifts agree with quantum chemistry calcula- 
tions (27) within ~20%. A linear extrapolation to 
p = 0 yielded the zero-density frequencies; for 
transitions with small dv/dp, the gradients mea- 
sured at higher densities p > 1 x 10° cm™ were 
used (23). 

The strong resonance signal from the large 
density of cold pHe* occupying the thermal dis- 
tribution depopulated by the laser yielded a small 
statistical uncertainty Ostat = +1 MHz on Vexp of the 
p'He™ transition (37, 35) > (38, 34), and Ogtat = 
+10 MHz for (84, 33) — (35, 32), which has a 
daughter state of large (1 GHz) natural width (fig. 
83 and table S1). A systematic uncertainty of 0.4 to 
3 MHz arose from the fitting function (30) itself. 
The magnetic Zeeman shifts in these transitions 
between Rydberg states are small (<0.1 MHz) un- 
der our experimental conditions. The spurious 
frequency modulation of the laser pulses (24, 25) 
was measured with a precision of 0.4 to 1.0 MHz. 
The shift in Uep arising from laser fields that in- 
duce ac Stark effects (30) was <5 MHz in our 
earlier two-photon experiment carried out at 
higher laser fluences (6); the shifts were smaller 
(<0.2 MHz) for the single-photon transitions 
studied here. 

All measured transition frequencies ¥,,,, (Fig. 3A, 
open circles with error bars) agree with theoret- 
ical Uj, values (filled squares) within the exper- 
imental uncertainties of 2.5 x 10~° to 15 x 10°. 
This agreement is 1.4 to 10 times better than in 
previous single-photon experiments (23). The un- 
certainties for most of the theoretical frequencies 
Um are due to uncalculated QED contributions of 
orders higher than ma’, whereas for some transi- 
tions [(37, 35) > (38, 34) of p*He* ] numerical un- 
certainties in the calculation of the relativistic 
Bethe logarithm (table S1) dominate. The correc- 
tions on Uy, due to the finite charge radii (12, 3D) of 
the helium nucleus (4 to 7 MHz) and of the anti- 
proton (<1 MHz) are small because the Rydberg 
antiproton orbital has negligible overlap with the 
nucleus and is polarized away from the Is electron. 

The frequencies v,;, for the unfavored (n, 1) > 
(n + 1, 2-1) transitions with low transition prob- 
abilities (30) between the n = 36 states changed 
by 2.6 x 10° to 2.7 x 10° when the Mj /mzratio 
used in the calculations was changed by 1 x 10°°. 
These sensitivities ky, = 2.6 to 2.7 (Table 1) of the 
transitions (5) measured here are >2x larger 
than those of the low-n or two-photon tran- 
sitions studied earlier. The Mz/m,value that 
yielded the best agreement between v,, and 
Vexp of the p*He* transition (37,35) — (38,34) 
deviated from the CODATA2010-recommended 
value for M,/me by 8y4,/m, = (7 + 10) x 10°"°. By 
minimizing )[om(M5/me) - Vexp] s/n OVET 
the 13 p*He* frequencies, and considering the 
above systematic uncertainties (o,,,), we ob- 
tained the ratio Mz/m, = 1836.1526734(15). 
The 1-SD uncertainty in the parenthesis in- 
cludes the contributions 9 x 10°’, 11 x 107%, 
and 3 x 10°’ of the experimental statistical and 
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systematic uncertainties and the theoretical uncer- 
tainty, respectively. This agrees with the determi- 
nation Mz/m_, = 1836.1526736(23) obtained from 
sub-Doppler two-photon laser spectroscopy (6). 

The atomic mass of the electron was recently 
determined (27) with a precision of 3 x 10° by 
measuring the ratio between the cyclotron fre- 
quency and the precession frequency of the elec- 
tron spin of a @C°* ion confined in a Penning 
trap and comparing the results with QED cal- 
culations of its g-factor. From this and the known 
proton mass, the proton-to-electron mass ratio 
was determined as M,,/m, = 1836.15267377(17), 
which is larger by 4 x 10°? and 8 x 107° than the 
ratios determined by comparing the cyclotron 
frequencies of protons and electrons in a Pen- 
ning trap and the CODATA2010-recommended 
value. Our Mz/m, value agrees with the result 
from (27) (Fig. 3B). The precision of our mea- 
surements here is a factor of 3.5 better than the 
recent determination of M,/m, by means of 
laser spectroscopy of HD* molecules (22). 

We used the analysis method of Hughes and 
Deutch (32-36) to constrain the equality between 
antiproton and proton masses 5 = (Mp, —- M;j)/ 
My, and charges 5g = (Q» + Qp)/@p from these 
measured p*Het transition frequencies. The 
p'He* transition frequencies are proportional 
to Q5M p and have linear dependencies 5y «y+ 
SQkQ < |Vexp—Vin|/in; the calculated sensitivities 
Ky and Kg are shown in Table 1. The right side of 
the equation was evaluated as <(3 + 15) x 10°° by 
averaging over the transitions and considering the 
statistical and systematic uncertainties. The Anti- 
hydrogen Trap (ATRAP) and Baryon Antibaryon 
Symmetry Experiment (BASE) collaborations (34-36) 
have, on the other hand, measured the cyclotron 
frequency (°¢Q;/M;) of antiprotons confined 
in Penning traps as (Q5/M5)/(Qp/Mp) +1= 
1.6(9) x 10° (34, 35) and 1(69) x 10°” (36), 
which implies that 5g ~ dy. By combining the 
two types of experimental results, we conclude 
that any deviation between the proton and anti- 
proton masses and charges is <5 x 10°”° at the 
90% confidence level. Recently, the ALPHA ex- 
periment has provided a limit of <7 x 10° on 
any residual charge of antihydrogen (37) by an- 
alyzing its movement in a magnetic bottle trap; 
when combined with the DHe’* results, this con- 
strains the equality of the electron and positron 
charges to a similar level. 
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OPTICAL PROCESSING 


A fully programmable 100-spin 
coherent Ising machine with 
all-to-all connections 
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Unconventional, special-purpose machines may aid in accelerating the solution of some 
of the hardest problems in computing, such as large-scale combinatorial optimizations, by 
exploiting different operating mechanisms than those of standard digital computers. We 
present a scalable optical processor with electronic feedback that can be realized at large 
scale with room-temperature technology. Our prototype machine is able to find exact 
solutions of, or sample good approximate solutions to, a variety of hard instances of Ising 
problems with up to 100 spins and 10,000 spin-spin connections. 


ombinatorial optimization problems, in- 

cluding many nondeterministic polynomial- 

time-hard (NP-hard) problems, are central 

in numerous important application areas, 

including operations and scheduling, drug 
discovery, finance, circuit design, sensing, and 
manufacturing. Despite large advances in both 
algorithms and digital computer technology, 
even typical instances of NP-hard problems that 
arise in practice may be very difficult to solve on 
conventional computers. There is a long history 
of attempts to find alternatives to current von 
Neumann-computer-based methods for solving 
such problems, including use of neural networks 
realized with analog electronic circuits (7, 2) and 
by using molecular computing (3-5). Both lines 
of investigation continue to inspire related work 


(6, 7). A major topic of contemporary interest is 
the study of adiabatic quantum computation 
(AQC) (8) and quantum annealing (QA) (9, 10). 
Sophisticated AQC/QA devices are already under 
study (11-14), but providing dense connectivity 
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between qubits remains a major challenge (15), 
with important implications for the efficiency 
of AQC/QA systems (16). 

Networks of coupled optical parametric oscil- 
lators (OPOs) are an alternative physical system, 


Pump 


Pulsed Laser 
1560 nm 


with an unconventional operating mech- 
anism (17-20), for solving the Ising problem 
(21, 22) and by extension many other com- 
binatorial optimization problems (23). For- 
mally, the N-spin Ising problem is to find the 


PPLN Waveguide 


OPOs 4 to 157 


<> Fiber beamsplitter 


Fig. 1. Experimental schematic of a measurement-feedback—based coherent Ising machine. 
A time-division—multiplexed pulsed degenerate optical parametric oscillator is formed by a nonlinear 
crystal [periodically poled lithium niobate (PPLN)] in a fiber ring cavity containing 160 pulses. A fraction 
of each pulse is measured and used to compute a feedback signal that effectively couples the 
otherwise-independent pulses in the cavity. IM, intensity modulator; PM, phase modulator; LO, local 
oscillator; SHG, second-harmonic generation; FPGA, field-programmable gate array. 


configuration of spins o; € {-1,+1} (i = 1, 
.., N) that minimizes the energy function 


H=- > STOO; - > h;o;, where the par- 
lsi<jsN lsisN 


ticular problem instance being solved is 
specified by the N x N matrix J (with ele- 
ments J;;) and the length-N vector h (with 
elements h,). 

We have realized a system with a scalable 
architecture that uses measurement feedback in 
place of a network of optical delay lines [which 
were used in initial, low-connectivity, nonre- 
programmable demonstrations of the concept 
(18, 24, 25)]. Our 100-spin Ising machine allows 
connections between any spin and any other 
spin and is fully programmable. We show that 
measurement-feedback-based OPO Ising ma- 
chines can solve many different Ising problems, 
and in cases in which exact solutions are not 
easy to obtain, we can find good approximate 
solutions. 

The schematic of our experimental setup 
(Fig. 1) shows that our Ising machine is 
formed by the combination of time-division- 
multiplexed OPOs (18) in a single fiber-ring 
cavity, with measurement and feedback (injec- 
tion) stages that act to couple the pulses in the 
cavity such that the Ising Hamiltonian is real- 
ized. Details are provided in the supplementary 
materials (26). 
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Fig. 2. Results with N = 16 cubic graphs. (A) A Mobius ladder graph with N = 16 vertices. (B) The evolution of the in-phase components c; of the N = 16 


OPO pulses as a function of the computation time. (©) The graph cut size achieved as a function of the computation time. (D to F) Histograms of 
obtained solutions in 100 runs for the graphs shown in the insets. (G) Histogram of the observed probabilities of obtaining a ground state in a single run, 
for all 4060 unweighted, undirected cubic graphs with N = 16 vertices. 
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Fig. 3. Results with various-size M6ébius ladder graphs. (A) Observed probability of obtaining a 
ground state of the Mobius ladder graph in a single run, as a function of the size N of the graph. Multiple 
100-run batches were performed for each graph size to obtain the standard deviations, which are shown 
as error bars. (B to D) Histograms of obtained solutions in 100 runs for the graphs shown in the insets. 


Fig. 4. Results with various- 
size and various-density 
random graphs. (A) Observed 
probability of obtaining a 
solution whose cut size is at 
least x% of the global optimum 
(maximum cut), as a function of 
graph size N, for random cubic 
graph instances. Error bars 
indicate 1 SD, which is dominated 
by the difference in difficulty 
between the various problem 
instances. (B) The runtime that 
would be required to obtain a 
solution of a particular accuracy 
with 99% probability. (C) The 
evolution of the in-phase 
components c; of the N = 100 
OPO pulses as a function of the 
computation time, for a single run 
with the graph shown in the (D) 
inset. (D) The graph cut size 
achieved as a function of the 
computation time. (Inset) The 
graph being solved. (E) Observed 
success probability of obtaining a 
solution with a particular accuracy 
as a function of the density of 
edges in the graph. Experiments 
were performed on randomly 
generated N = 100-vertex graphs 
with fixed numbers of edges. 
Error bars indicate 1 SD. 
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We have focused our experiments on Ising 
problems on undirected, unweighted graphs 
(V, vertices; E, edges), where J = -1 when spin 7 
and spin j are connected, and J; = 0 otherwise, 
and for which the linear (Zeeman) terms h; are 
zero. An Ising problem of this form is equiv- 
alent to the Max-Cut problem on the underlying 
graph (17). Max-Cut is the problem of partition- 
ing the vertices of a graph into two disjoint 
subsets such that the number of edges between 
the two subsets is maximized; the partition is 
called a cut. Max-Cut remains NP-hard even when 
the input is restricted to unweighted cubic graphs 
(27). We refer interchangeably to the Max-Cut 
problem and to the Ising problem, and the sol- 
utions thereof. The energy of a particular spin 
configuration {o;};-1,.,.v for an Ising problem is 


-»> Jy Oioj- > hjoi. 


lsi<j<N lsisN 
The same spin anes {o;} represents a 


> Jig — 5A ({oi})5 


eee ee 


given by H({o;}) = 


cut of size C({o;}) 


there is a direct mapping between the ener- 
gies in the Ising problem and the cut sizes in 
the Max-Cut problem, and minimizing the Ising 
energy maximizes the cut. 

The unweighted, undirected Mobius ladder 
(cubic) graph with N = 16 vertices is shown in 
Fig. 2A. We programmed the corresponding J 
matrix into our feedback electronics and ran 
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the problem on our experimental apparatus. 
Figure 2B shows, from a single run of this single 
problem instance, the evolution of the in-phase 
component of each OPO pulse (c¢;) as a function 
of the number of times each pulse circulated 
around the cavity (the number of round trips). 
The computation time is given by Teomp = Trip 
where 7,, = 1.6 us is the single round trip time 
and N,, is the number of round trips. Each spin 
o; in the Ising problem is represented by a sin- 
gle OPO pulse; if the in-phase component ¢; of 
the ith OPO pulse is less than zero (c; < 0), we 
make the spin assignment o; = -1, and if the in- 
phase component is greater than zero (c¢; > 0), 
we make the spin assignment o; = +1. As the 
feedback signal level is gradually increased, the 
OPO amplitudes increase. Most of the OPOs 
obtain their ultimate signs by round trip 60. By 
round trip 100, all the OPOs have reached a 
steady state. Shown in Fig. 2C is the graph cut 
size, or equivalently the Ising energy, represented 
by the spin configuration after each roundtrip for 
the run shown in Fig. 2B. We see that the sys- 
tem evolves toward solutions representing larger 
graph cuts, or equivalently lower Ising energies, 
and the steady-state configuration is that of a 
ground state. The system finds the ground state 
within 120 us. If we rerun the computation many 
times, we find that our system always returns the 
optimal solution for this particular problem in- 
stance (Fig. 2D). The distributions of obtained 
solutions for two other N = 16 cubic graphs are 
shown in Fig. 2, E and F. In both cases, the sys- 
tem finds the ground state in a large fraction of 
the runs, and when the system does not return 
a ground state, it returns a state that has energy 
close to that of the ground state, a good approx- 
imate solution. To illustrate that these instances 
are not special cases for which the Ising ma- 
chine finds exact solutions (ground states), we 
attempted to solve all possible problem instan- 
ces of N = 16 cubic graphs, of which there are 
4060. We were able to find ground states with 
probability greater than 20% for every single 
instance (Fig. 2G). In this experiment, and all 
that follow, every run was set to proceed for 
N,, = 300 round trips, a computation time of 
Teomp = 480 is. 

We next explored how the Ising machine per- 
forms as a function of the size of the problem. 
In this first scaling study, we used the Mobius 
ladder graphs with varying size. This is a con- 
venient choice because there is a closed-form 
solution for the maximum cut and Ising energy 
for every instance in this family of graphs. The 
Mobius ladder graphs with N = 8, 12, ..., 100 
were solved (Fig. 3A). For each instance, we per- 
formed multiple sets of 100 runs. Each 100-run 
set resulted in a solution-energy histogram (an 
example for N = 8 is shown in Fig. 3B). For the 
runs in which the ground state is not found, the 
system again finds good approximate solutions 
(Fig. 3, C and D). 

To understand how the performance of our 
system scales as a function of problem size for 
more general problems, 10 random instances of 
cubic graphs were generated for each graph size 
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N = 10, 20, ..., 100, and each instance run on our 
apparatus. The success probabilities for each in- 
stance are shown in Fig. 4A, aggregated by graph 
size. In contrast to the previous results, not only 
the probability of obtaining a ground state but 
also the probability of obtaining a solution that 
is within 7% of the optimal (maximum cut) so- 
lution is shown. The error bars indicate a spread 
in success probabilities primarily due to the fact 
that different random instances of each size 
may be more or less difficult to solve. 

Figure 4B shows, for each graph size, the total 
computation time required to obtain a solution 
with the given accuracy, with probability 99% 
within that time, and is derived directly from 
Fig. 4A. The total time is given by Toomp x Mog(1 - 
0.99)/log(1 - p,)l, where p, is the corresponding 
success probability from Fig. 4A. The total com- 
putation time required to obtain ground states 
(100% accuracy) grows rapidly with problem 
size N, although it is still small in absolute terms 
for N = 100: less than 200 ms. The growth in 
total computation time is far less severe when 
the required solution accuracy is reduced; for 
95% accuracy, the required time increases from 
~1 ms when N = 10 to ~4 ms when WN = 100; for 
90% accuracy, the required time remains roughly 
constant at ~1 ms. 

The OPO amplitudes and cut sizes/energies 
are shown in Fig. 4, C and D, respectively, as a 
function of the number of roundtrips for a sin- 
gle run on a single nonregular random graph 
instance with N = 100 vertices and |E| = 495 
edges. We see that the global optimum is reached 
within 100 us, and the system reaches a steady 
state after ~120 us. This graph instance is one 
of 10 instances that were generated having 
495 edges. We generated 10 random graphs 
with a fixed number of edges, for each of |E| = 
50, 495, 990, 1485, 1980, 2475, 2970, 3465, 3960, 
4455, 4900, and ran these instances on our ap- 
paratus. The results of this study of the per- 
formance on random graphs as a function of 
edge density d = 2|E|/(|V|({V| - 1)] are shown 
in Fig. 4E; the system was able to find good 
solutions for all the attempted densities. 

OPO Ising machines have the potential to har- 
ness a number of quantum features, including 
squeezing and superposition (20). Pulsed OPOs 
already have a substantial quantum-mechanical 
nature (28), and networks of OPOs can generate 
spatial multimode entanglement (19). The exper- 
imental system reported in this work is well- 
described by a quantum-mechanical model (20) 
[with very good agreement between our experimen- 
tal results and numerical simulation results (26)]. 
However, the extent to which classical models 
can capture the behavior of OPO Ising machines 
is not yet known, and follow-up studies are needed 
to determine the fundamental computational pow- 
er of OPO Ising machines. Two future modifica- 
tions to the experiment that could increase the 
relevance of quantum mechanics are reducing 
the cavity round-trip loss and injecting squeezed 
vacuum states into the open port of the mea- 
surement beamsplitter (29), which is predicted 
to improve the success probability (26). 


Although we find the overall direction of OPO 
Ising machines promising, the techniques we 
have demonstrated are not necessarily restricted 
to OPO Ising machines, and photonic-AQC and 
boson-sampling (30) experiments, among others, 
may be able to adapt our methods. 
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Mutational signatures associated with 
tobacco smoking in human cancer 
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Tobacco smoking increases the risk of at lea 


st 17 classes of human cancer. We analyzed 


somatic mutations and DNA methylation in 5243 cancers of types for which tobacco 
smoking confers an elevated risk. Smoking is associated with increased mutation burdens 
of multiple distinct mutational signatures, which contribute to different extents in different 


cancers. One of these signatures, mainly fou 


nd in cancers derived from tissues directly 


exposed to tobacco smoke, is attributable to misreplication of DNA damage caused by 
tobacco carcinogens. Others likely reflect indirect activation of DNA editing by APOBEC 
cytidine deaminases and of an endogenous clocklike mutational process. Smoking is 
associated with limited differences in methylation. The results are consistent with 

the proposition that smoking increases cancer risk by increasing the somatic mutation 
load, although direct evidence for this mechanism is lacking in some smoking-related 


cancer types. 


obacco smoking has been associated with 

at least 17 types of human cancer (Table 1) 

and claims the lives of more than 6 million 

people every year (I-4). Tobacco smoke is a 

complex mixture of chemicals, among which 
at least 60 are carcinogens (5). Many of these are 
thought to cause cancer by inducing DNA damage 
that, if misreplicated, leads to an increased bur- 
den of somatic mutations and, hence, an elevated 
chance of acquiring driver mutations in cancer 
genes. Such damage often occurs in the form of 
covalent bonding of metabolically activated re- 
active species of the carcinogen to DNA bases, 
termed DNA adducts (6). Tissues directly exposed 
to tobacco smoke (e.g., lung), as well as some 
tissues not directly exposed (e.g., bladder), show 
elevated levels of DNA adducts in smokers and, 
thus, evidence of exposure to carcinogenic com- 
ponents of tobacco smoke (7, 8). 

Each biological process causing mutations in 
somatic cells leaves a mutational signature (9). 
Many cancers have a somatic mutation in the 
TP53 gene, and catalogs of TP53 mutations com- 
piled two decades ago enabled early exploration 
of these signatures (J0), showing that lung can- 
cers from smokers have more C>A transversions 
than lung cancers from nonsmokers (11-14). To 
investigate mutational signatures using the thou- 
sands of mutation catalogs generated by systematic 


cancer genome sequencing, we recently described 
a framework in which each base substitution 
signature is characterized using a 96-mutation 
classification that includes the six substitution 
types together with the bases immediately 5’ and 
3' to the mutated base (75). The analysis extracts 
mutational signatures from mutation catalogs and 
estimates the number of mutations contributed 
by each signature to each cancer genome (15). 
Using this approach, more than 30 different 
base substitution signatures have been identi- 
fied (16-18). 

In this study, we examined 5243 cancer ge- 
nome sequences (4633 exomes and 610 whole 
genomes) of cancer classes for which smoking 
increases risk, with the goal of identifying mu- 
tational signatures and methylation changes as- 
sociated with tobacco smoking (table S1). Of the 
samples we studied, 2490 were reported to be 
from tobacco smokers and 1063 from never- 
smokers (Table 1). Thus, we were able to inves- 
tigate the mutational consequences of smoking 
by comparing somatic mutations and methyla- 
tion in smokers versus nonsmokers for lung, 
larynx, pharynx, oral cavity, esophageal, bladder, 
liver, cervical, kidney, and pancreatic cancers 
(Fig. 1 and table S2). 

We first compared total numbers of base sub- 
stitutions, small insertions and deletions (indels), 


and genomic rearrangements. The total number 
of base substitutions was higher in smokers 
compared with nonsmokers for all cancer types 
together (q-value < 0.05) and, for individual can- 
cer types, in lung adenocarcinoma, larynx, liver, 
and kidney cancers (table S2). Total numbers of 
indels were higher in smokers compared with 
nonsmokers in lung adenocarcinoma and liver 
cancer (table S2). The whole-genome-sequenced 
cases allowed comparison of genome rearrange- 
ments between smokers and nonsmokers in pan- 
creatic and liver cancer, where no differences 
were found (table S2). However, subchromo- 
somal copy-number changes entail genomic 
rearrangement and can serve as surrogates for 
rearrangements. Lung adenocarcinomas from 
smokers exhibited more copy-number aberra- 
tions than those from nonsmokers (table 82). 

We then extracted mutational signatures, es- 
timated the contributions of each signature to 
each cancer, and compared the numbers of mu- 
tations attributable to each signature in smokers 
and nonsmokers. Increases in smokers compared 
with nonsmokers were seen for signatures 2, 4, 5, 
13, and 16 [the mutational signature nomencla- 
ture is that used in the Catalogue of Somatic 
Mutations in Cancer (COSMIC) and in (J6-18)]. 
There was sufficient statistical power to show that 
these increases were of clonal mutations (muta- 
tions present in all cells of each cancer) for 
signatures 4 and 5 (q < 0.05), as expected if 
these mutations are due to cigarette smoke ex- 
posure before neoplastic change (supplemen- 
tary text). 

Signature 4 is characterized mainly by C>A 
mutations with smaller contributions from other 
base substitution classes (Fig. 2B and fig. S1). This 
signature was found only in cancer types in 
which tobacco smoking increases risk and mainly 
in those derived from epithelia directly exposed 
to tobacco smoke (figs. S2 and $3). Signature 4 is 
very similar to the mutational signature induced in 
vitro by exposing cells to benzo[a]pyrene (cosine 
similarity = 0.94) (Fig. 2B and fig. S3), a tobacco 
smoke carcinogen (19). The similarity extends to 
the presence of a transcriptional strand bias in- 
dicative of transcription-coupled nucleotide ex- 
cision repair (NER) of bulky DNA adducts on 
guanine (fig. S1), the proposed mechanism of 
DNA damage by benzo[a]pyrene. Thus, signa- 
ture 4 is likely the direct mutational consequence 
of misreplication of DNA damage induced by 
tobacco carcinogens. 

Most lung and larynx cancers from smokers 
had many signature 4 mutations. Signature 4 
mutations occurred more often in cancers from 
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smokers compared with nonsmokers in all cancer 
types together (table S2) and in lung squamous, 
lung adenocarcinoma, and larynx cancers (table 
$2). This finding largely accounts for differences 
in total numbers of base substitutions (Table 1). 
In nonsmokers, 13.8% of lung cancers showed 
many signature 4 mutations (Fig. 2A; >1 mu- 
tation per megabase), which may be due to pas- 
sive smoking, misreporting of smoking habits, 
or annotation errors. Signature 4 mutations 
were also detected in cancers of the oral cavity, 
pharynx, and esophagus, albeit in much smaller 
numbers than in lung and larynx cancers, per- 
haps because of reduced exposure to tobacco 
smoke or more efficient clearance. Differences 
in mutation burden attributed to signature 4 be- 
tween smokers and nonsmokers were not observed 
in these cancer types (Fig. 1). Signature 4 mu- 
tations were found at low levels in cancers of 
the liver, an organ not directly exposed to to- 
bacco smoke, and were elevated in smokers 
versus nonsmokers (Fig. 1). 

Signature 4 was not extracted from bladder, 
cervical, kidney, or pancreatic cancers, despite 
the known risks conferred by smoking and 
the presence of many smokers in these series. 
Additionally, this mutational signature was not 
extracted from cancers of the stomach, colorectum, 


and ovary, nor from acute myeloid leukemia (in 
the analyzed series, the smoking status of pa- 
tients with these cancers was unknown, but it is 
likely that many have been smokers). The tis- 
sues from which all of these cancer types are 
derived are not directly exposed to tobacco smoke. 
Simulations indicate that the lack of signature 
4 is not due to statistical limitations (supple- 
mentary text and fig. S4). The absence of sig- 
nature 4 suggests that misreplication of direct 
DNA damage due to tobacco smoke constituents 
does not contribute substantially to mutation 
burden in these cancers, even though DNA ad- 
ducts indicative of tobacco-induced DNA dam- 
age are present in the tissues from which they 
arise (7). 

Signatures 2 and 13 are characterized by C>T 
and C>G mutations, respectively, at TpC dinu- 
cleotides and have been attributed to overactive 
DNA editing by APOBEC deaminases (20, 27). 
The cause of the overactivity in most cancers 
has not been established, although APOBECs are 
implicated in the cellular response to the entrance 
of foreign DNA, retrotransposon movement, and 
local inflammation (22). Signatures 2 and 13 
showed more mutations in smokers versus non- 
smokers with lung adenocarcinoma (table S2). 
Because these signatures are found in many 


other cancer types, where they are apparently 
unrelated to tobacco smoking, it seems unlikely 
that the signature 2 and 13 mutations associated 
with smoking in lung adenocarcinoma are direct 
consequences of misreplication of DNA damage 
induced by tobacco smoke. More plausibly, the 
cellular machinery underlying signatures 2 and 
13 is activated by tobacco smoke, perhaps as a 
result of inflammation arising from the deposi- 
tion of particulate matter or by indirect con- 
sequences of DNA damage. 

Signature 5 is characterized by mutations dis- 
tributed across all 96 subtypes of base substi- 
tution, with a predominance of T>C and C>T 
mutations (Fig. 2B) and evidence of transcrip- 
tional strand bias for T>C mutations (78). Signa- 
ture 5 is found in all cancer types, including those 
unrelated to tobacco smoking, and in most can- 
cer samples. It is “clocklike” in that the number 
of mutations attributable to this signature cor- 
relates with age at the time of diagnosis in many 
cancer types (17). Signature 5, together with sig- 
nature 1, is thought to contribute to mutation ac- 
cumulation in most normal somatic cells and in 
the germline (77, 23). The mechanisms underlying 
signature 5 are not well understood, although 
an enrichment of signature 5 mutations was 
found in bladder cancers harboring inactivating 


Table 1. Mutational signatures and cancer types associated with tobacco 
smoking. Information about the age-adjusted odds ratios for current male 
smokers to develop cancer is taken from (2-4). Odds ratios for small cell lung 
cancer, squamous cell lung cancer, and lung adenocarcinoma are for an average 
daily dose of more than 30 cigarettes. Odds ratios for cervical and ovarian cancers 
are for current female smokers. Detailed information about all mutation types, all 
mutational signatures, and DNA methylation is provided in table S2. Nomenclature 
for signature identification numbers is consistent with the COSMIC database 


Cancer type a Nonsmokers Smokers mutational signatures 
found in the cancer type 
All cancer types ND 1062 2490 26 


Small cell lung cancer 


Total number of 


(http://cancer.sanger.ac.uk/cosmic/signatures). The numbers of smokers and 
nonsmokers are unknown (i.e., not reported in the original studies) for acute 
myeloid leukemia, stomach, ovarian, and colorectal cancers. The patterns of all 
mutational signatures with elevated mutation burden in smokers are displayed 
in Fig. 2B. N/A denotes lack of smoking annotation for a given cancer type. 
Asterisks indicate that a signature correlates with pack years smoked in a 
cancer type. N.S. reflects cancer types without statistically significant elevation 
of mutational signatures. The odds ratio for all cancer types is not provided. 


Mutational signatures 
with elevated 


ignature 4 
Siena lke mutation burden 
found in A 
eaneeeane in smokers 
P versus nonsmokers 
(q < 0.05) 
Y Al, Bye 


Lb 
Lung adenocarcinoma 
IE 


Oral cavity 
* 
Esophagus adenocarcinoma 


Colorectal NS) 


SCIENCE sciencemag.org 


4 NOVEMBER 2016 © VOL 354 ISSUE 6312 619 


Downloaded from http://science.sciencemag.org/ on November 10, 2016 


RESEARCH | REPORTS 


mutations in ERCC2, which encodes a compo- 
nent of NER (24). 

Signature 5 (or a similar signature that is dif- 
ficult to differentiate from signature 5 because 
of the relatively flat profiles of these signatures) 
was increased by a factor of 1.3 to 5.1 (q < 0.05; 
table S2) in smokers versus nonsmokers in all 
cancer types together and in lung squamous, 
lung adenocarcinoma, larynx, pharynx, oral cav- 
ity, esophageal squamous, bladder, liver, and 
kidney cancers. The association of smoking with 
signature 5 mutations across these nine cancer 
types therefore includes some for which the 
risks conferred by smoking are modest and for 
which normal progenitor cells are not directly 
exposed to cigarette smoke (Table 1). Given the 
clocklike nature of signature 5 (17), its presence 
in the human germline (23), its ubiquity in 


cancer types unrelated to tobacco smoking (18), 


and its widespread occurrence in nonsmokers, 
it seems unlikely that signature 5 mutations 
associated with tobacco smoking are direct 
consequences of misreplication of DNA dam- 
aged by tobacco carcinogens. It is more plausible 
that smoking affects the machinery generating 
signature 5 mutations (24). Presumably as a con- 
sequence of the effects of smoking, signature 5 
mutations correlated with age at the time of 
diagnosis in nonsmokers (P = 0.001) but not in 
smokers (P = 0.59). 

Signature 16 is predominantly characterized 
by T>C mutations at ApT dinucleotides (Fig. 2B); 
exhibits a strong transcriptional strand bias 
consistent with almost all damage occurring 
on adenine (fig. S5); and, thus far, has been 
detected only in liver cancer. The underlying 
mutational process is currently unknown. Sig- 


nature 16 exhibited a higher mutation burden 


in smokers versus nonsmokers with liver cancer 
(table S2). 

For smokers with lung, larynx, pharynx, oral 
cavity, esophageal, bladder, liver, cervical, kidney, 
and pancreatic cancers, quantitative data on cu- 
mulative exposure to tobacco smoke were avail- 
able (table S1). Total numbers of base substitution 
mutations were positively correlated with pack 
years smoked (1 pack year is defined as smoking 
one pack per day for 1 year) for all cancer types 
together (qg < 0.05) and for lung adenocarcinoma 
(table $3). For individual mutational signatures, 
correlations with pack years smoked were found 
in multiple cancer types for signatures 4 and 5 
(table S3). Signature 4 correlated with pack years 
in lung squamous, lung adenocarcinoma, larynx, 
and liver cancers. Signature 5 correlated with 
pack years in all cancers together, as well as in 
lung adenocarcinoma, pharynx, oral cavity, and 
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Fig. 1. Comparison between tobacco smokers and lifelong nonsmokers. 
Bars are used to display average values for numbers of somatic substitutions 
per megabase (MB), numbers of indels per megabase, numbers of dinu- 
cleotide mutations per megabase (Dinucs), numbers of breakpoints per mega- 
base (Breaks), fraction of the genome that shows copy-number changes 
(Aberr.), and numbers of mutations per megabase attributed to mutational 
signatures found in multiple cancer types associated with tobacco smoking. 
Light gray bars represent nonsmokers, whereas dark gray bars are for smokers. 
Comparisons between smokers and nonsmokers for all features, including mu- 
tational signatures specific for a cancer type and overall DNA methylation, are 
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Lifelong non-smokers [Smokers ***q-value < 10° **q-value < 10? *q-value < 0.05 


provided in table S2. Error bars correspond to 95% confidence intervals for 
each feature. Each q value is based on a two-sample Kolmogorov-Smirnov test 
corrected for multiple hypothesis testing for all features in a cancer type. 
Cancer types are ordered based on their age-adjusted odds ratios for smoking, 
as provided in Table 1. Data for numbers of breakpoints per megabase and 
fraction of the genome that shows copy-number changes were not available 
for liver cancer and small cell lung cancer. Adeno, adenocarcinoma; Esophag., 
esophagus. Note that the presented data include only a few cases (<10) of 
nonsmokers for small cell lung cancer, squamous cell lung cancer, and cancer 
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bladder cancers (table S3). In lung adenocarci- 
noma, correlations with pack years smoked were 
also observed for signatures 2 and 13. The rates 
of these correlations allow estimation of the ap- 
proximate numbers of mutations accumulated in 
anormal cell of each tissue due to smoking a pack 
of cigarettes a day for a year: lung, 150 mutations; 
larynx, 97; pharynx, 39; oral cavity 23; bladder, 18; 
liver, 6 (table S3). 

Consistent with our results, previous studies 
have reported higher numbers of total base sub- 
stitutions in lung adenocarcinoma in smokers 
versus nonsmokers (mainly due to C>A substitu- 
tions) (25, 26). The same is true of signatures 4 
and 5 in lung adenocarcinoma (/8), signature 4 
in liver cancer (27), and signature 5 in bladder 
cancer (24). 

Differential methylation of the DNA of normal 
cells of smokers compared to nonsmokers has 
been reported (28). Using data from methyl- 
ation arrays, each containing ~470,000 of the 
~28 million CpG sites in the human genome, 
we evaluated whether differences in methyla- 
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tion are found in cancers. Overall levels of CpG 
methylation in DNA from cancers were similar 
in smokers and nonsmokers for all cancer types 
(fig. S6). Individual CpGs were differentially 
methylated (>5% difference) in only two cancer 
types: 369 CpGs were hypomethylated and 65 
were hypermethylated in lung adenocarcinoma, 
with five hypomethylated and three hypermethyl- 
ated in oral cancer (Fig. 3 and fig. S7). CpGs 
exhibiting differences in methylation clustered 
in certain genes but were not associated with 
known cancer genes more than expected by 
chance, nor with genes hypomethylated in nor- 
mal blood or buccal cells of tobacco smokers 
(fig. S8 and tables S4 and S5) (28). Therefore, 
with the exception of lung cancer, CpG methyl- 
ation showed limited differences between the 
cancers of smokers and nonsmokers (Fig. 3). 
The genomes of smoking-associated cancers 
permit reassessment of our understanding of 
how tobacco smoke causes cancer. Consistent 
with the proposition that an increased mutation 
load caused by tobacco smoke contributes to in- 


creased cancer risk, the total mutation burden 
is elevated in smokers versus nonsmokers with 
lung adenocarcinoma, larynx, liver, and kidney 
cancers. However, differences in total mutation 
burden were not observed in the other smoking- 
associated cancer types and, in some, there were 
no statistically significant smoking-associated dif- 
ferences in mutation load, signatures, or DNA 
methylation. Caution should be exercised in the 
interpretation of the latter observations. In addi- 
tion to limitations of statistical power, multiple 
rounds of clonal expansion over many years are 
often required for development of a symptomatic 
cancer. It is thus conceivable that, in the normal 
tissues from which smoking-associated cancer 
types originate, there are more somatic muta- 
tions (or differences in methylation) in smokers 
than in nonsmokers but that these differences 
become obscured during the intervening clonal 
evolution. Moreover, some theoretical models pre- 
dict that relatively small differences in mutation 
burden caused by smoking in preneoplastic cells 
could account for the observed increases in cancer 


C>G C>T TA T>C T>G 


Fig. 2. Mutational signatures associated with tobacco smoking. (A) Each 
panel contains 25 randomly selected cancer genomes (represented by in- 
dividual bars) from either smokers or nonsmokers in a given cancer type. The 
y axes reflect numbers of somatic mutations per megabase. Each bar is 
colored proportionately to the number of mutations per megabase attributed 
to the mutational signatures found in that sample. The naming of mutational 
signatures is consistent with previous reports (16-18). (B) Each panel con- 
tains the pattern of a mutational signature associated with tobacco smoking. 
Signatures are depicted using a 96-substitution classification defined by the 
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substitution type and sequence context immediately 5’ and 3’ to the mutated 
base. Different colors are used to display the various types of substitutions. 
The percentages of mutations attributed to specific substitution types are on 
the y axes, whereas the x axes display different types of substitutions. Mu- 
tational signatures are depicted based on the trinucleotide frequency of the 
whole human genome. Signatures 2, 4, 5, 13, and 16 are extracted from can- 
cers associated with tobacco smoking. The signature of benzo[a]pyrene is 
based on in vitro experimental data (19). Numerical values for these muta- 
tional signatures are provided in table S6. 
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Fig. 3. Differentially methylated individual CpGs in tobacco smokers across cancers associated 
with tobacco smoking. Each dot represents an individual CpG. The x axes reflect differences in meth- 
ylation between lifelong nonsmokers and smokers, where positive values correspond to hypermethylation 
and negative values to hypomethylation. The y axes depict levels of statistical significance. Results satisfy- 
ing a Bonferroni threshold of 10°” (above the red line) are considered statistically significant. 


risks (29). Other models indicate that differences 
in mutation burden between smokers and non- 
smokers need not be observed in the final cancers 
(supplementary text and fig. S6). Thus, increased 
somatic mutation loads in precancerous tissues 
may still explain the smoking-induced risks of 
most cancers, although other mechanisms have 
been proposed (30, 32). 

However, the generation of increased somatic 
mutation burden by tobacco smoking appears 
to be mechanistically complex. Smoking correlates 
with increases in base substitutions of multiple 
mutational signatures, together with increases 
in indels and copy-number changes. The extent 
to which these distinct mutational processes op- 
erate differs between tissue types (at least partially 
depending on the degree of direct exposure to 
tobacco smoke), and their mechanisms range 
from misreplication of DNA damage caused by 
tobacco smoke constituents to activation of more 
generally operative mutational processes. Although 
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we cannot exclude roles for covariate behaviors 
of smokers or differences in the biology of can- 
cers arising in smokers compared with non- 
smokers, smoking itself is most plausibly the 
cause of these differences. 
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CHEMICAL BIOLOGY 


A chemical biology route 
to site-specific authentic 
protein modifications 


Aerin Yang,’ Sura Ha,’ Jihye Ahn,” Rira Kim,’ Sungyoon Kim,’ Younghoon Lee,’ 
Jaehoon Kim,” Dieter Séll,** Hee-Yoon Lee,’* Hee-Sung Park’* 


Many essential biological processes are controlled by posttranslational protein 
modifications. The inability to synthetically attain the diversity enabled by these 
modifications limits functional studies of many proteins. We designed a three-step 
approach for installing authentic posttranslational modifications in recombinant proteins. 
We first use the established O-phosphoserine (Sep) orthogonal translation system to 
create a Sep-containing recombinant protein. The Sep residue is then dephosphorylated to 
dehydroalanine (Dha). Last, conjugate addition of alkyl iodides to Dha, promoted by zinc 
and copper, enables chemoselective carbon-carbon bond formation. To validate our 
approach, we produced histone H3, ubiquitin, and green fluorescent protein variants with 
site-specific modifications, including different methylations of H3K79. The methylated 
histones stimulate transcription through histone acetylation. This approach offers a 
powerful tool to engineer diverse designer proteins. 


osttranslational modifications (PTMs) play 

vital roles in expanding protein functional 

diversity and critically affect numerous bio- 

logical processes (1). The availability of pro- 

teins with specific modifications at selected 
residues is essential for experimental strategies to 
investigate fundamental biological mechanisms. 
Methods to generate diverse native protein cova- 
lent modifications do not exist at present. Genetic 
code expansion approaches are useful in producing 
recombinant proteins with specific modifications 
(2, 3) but rely on the availability of an orthogonal 
transfer RNA (tRNA) synthetase*tRNA pair for 
acylation of a specific noncanonical amino acid. 
Despite much progress, the creation of many im- 
portant protein modifications (e.g., trimethyllysine) 
is not yet feasible. Among chemical conjugation 
approaches (4), Cys-based strategies have been 
widely applied to generate protein conjugates (5) 
and mimics of PTMs (6, 7). Yet the final products 
are PTM analogs whose value for finding uniden- 
tified properties of the natural system may be 
questionable (8). Thus, despite extensive efforts, 
synthetic approaches for many authentic PTMs 
are not available, because no carbon-carbon bond- 
forming reactions have thus far been success- 
fully applied to protein modifications, despite 
the prevalence of such reactions in organic chem- 
istry (9). 
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Here we propose a three-step strategy (Fig. 1A) 
that, in principle, is applicable to generating diverse 
forms of authentic and selective protein modifica- 
tions. (i) The site of the intended modification is 
established by cotranslational O-phosphoserine 
(Sep) incorporation into a recombinant protein by 
using the Sep orthogonal Escherichia coli trans- 
lation system (JO, 11). (ii) Then, the Sep residue of 
the purified recombinant protein is converted by 
phosphate removal to dehydroalanine (Dha), which 
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in turn serves as a radicalophile, enabling a bio- 
orthogonal chemical reaction. (iii) Last, PTM 
moieties are directly coupled to Dha through metal- 
mediated conjugate additions of alkyl iodides in 
aqueous solution, facilitating chemoselective carbon- 
carbon bond formation in proteins (Fig. 1B). 

To demonstrate the feasibility and versatility 
of our strategy, we set out to generate recombinant 
Xenopus laevis histone H3K79 with five different 
modifications: monomethylation, dimethylation, 
trimethylation, formylation, and acetylation. Re- 
versible lysine methylation in proteins presents 
the most complex and dynamic modification (2). 
Several approaches have attempted lysine meth- 
ylation (73), but the trimethylated product has 
proved elusive. Histone H3K79 appears to be dynam- 
ically regulated (74) and associated with diverse 
cellular processes; its exact role has not been fully 
examined because such authentically modified 
histones have not been generated. 

First, the Sep-containing histone H3Sep79 was 
made by expressing a X. laevis histone H3 
mRNA containing a UAG codon at position 79 
and a coding sequence for a C-terminal His, tag 
in E. coli containing the orthogonal Sep trans- 
lation system [the engineered SepRS9¢tRNAS? 
pair and the evolved elongation factor (EF)-Sep21] 
(1D. H3Sep79 was routinely obtained in good 
yield (~20 mg/liter of culture) (fig. S1). The in- 
corporation of Sep at the intended position (79) 
was confirmed by matrix-assisted laser desorption/ 
ionization-time-of-flight mass spectrometry (MALDI- 
TOF MS) analysis of the purified recombinant 
histone (fig. S2) and its tryptic peptides (Fig. 2A 
and tables S1 and 82). 

Because phosphoamino acids are known to be 
labile under alkaline conditions (15), we attempted 
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PTM moiety attachment 
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dehydroalanine 


PTM modified 


e.g., Ry = monomethyllysine 
R, = dimethyllysine 
Rg = trimethyllysine 
R, = formyllysine 
R, = acetyllysine 


Fig. 1. Approach for protein chemical modifications. (A) Schematic representation of the three- 
step synthesis of proteins with authentic PTMs. (B) Our coupling scheme enables chemoselective 
carbon-carbon bond formation in proteins. R, PTM moiety. 
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Fig. 2. Synthesis of proteins with selective differential methylations. (A) MALDI-TOF MS analysis of proteins after trypsin digestion. Formation of 
H3Sep79 is evidenced by the Sep-containing peptide (Sep79, blue). Synthesis of H3Dha79 generates a new Dha-containing peptide (Dha79, green) but eliminates the 
Sep-containing peptide. Coupling of the methyl iodides onto H3Dha79 produces new methylated lysine-containing peptides (K79mel, K79me2, and K79me3, red). 
m/z, mass/charge ratio; [M+H]*, monoisotopic mass; a.u., arbitrary units. (B) Western blot analysis of the modified proteins, using antibodies (a) against H3K79mel, 
H3K79me2, and H3K79me3. CB, Coomassie blue. (C) Chemical structures of lysine and differentially methylated lysine residues. 


to convert Sep to Dha by mild alkali treatment. 
To determine the optimal procedure, purified 
H3Sep79 was incubated with LiOH, Ba(OH),s, 
or Sr(OH), under different conditions, followed 
by neutralization with acetic acid and dialysis 
against distilled water (fig. S3). The reaction pro- 
gress was analyzed by MALDI-TOF MS after in-gel 
trypsin digestion (tables S1 and S2). Near-complete 
disappearance of the Sep residue and concurrent 
generation of Dha took 30 min at room temper- 
ature in 40 mM Ba(OH), solution. Mass analysis 
of whole proteins (fig. S4) and tryptic peptides 
(Fig. 2A) demonstrated that these conditions led 
to highly selective chemical transformation of 
H3Sep79 into H3Dha79 with no noticeable pro- 
tein oxidation or other side reactions (J6). 
Dha-mediated conjugation approaches have 
been successfully applied to the generation of 
thiol-linked PTM analogs or mimics (17, 18). To 
produce authentic PTMs from Dha, we needed a 
new coupling scheme enabling chemoselective 
carbon-carbon bond formation in proteins (Fig. 
1B). We were encouraged by reports of water- 
based organic reactions in which an alkene group 
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functions as a radical acceptor (79). Knowing that 
Dha could function as a radical acceptor, we chose 
alkyl radicals that can be generated from alkyl 
iodides by transition metals (19). Inspired by re- 
cent metal-mediated conjugate addition reactions 
(20), we reasoned that organozinc species would 
form from the precursor halides and Zn metal. Sub- 
sequently, zinc-to-copper transmetalation would 
occur, generating organocopper reagents, which 
would lead to the formation of alkyl radical species. 
Last, conjugate coupling of the alkyl radical with 
the Dha residue of a protein would take place 
(Fig. 1B). To demonstrate the proposed coupling 
scheme, we tested first the addition of trimethyl 
iodide (3-iodo-N,N,N,-trimethylpropan-l-amine, 
3; fig. S5) to H3Dha79 to generate the expected 
product H3K79me3. The reaction products were 
analyzed by Western blotting using an antibody 
against H3K79me3. Our initial attempts to di- 
rectly use water-based organic reactions were un- 
successful. After testing a large array of conditions 
(described in the supplementary materials, figs. 
S6 to S16)—including different buffers and pH 
ranges, surfactants for protein stabilization, essential 


reagents for the metal-mediated coupling reaction 
(Zn metals and copper salts), and possible auxi- 
liary reagents [e.g., tetramethylethylenediamine 
(TMEDA)]—we arrived at conditions that repro- 
ducibly led to the formation of carbon-carbon 
bonds in high yield. The deduced optimal condi- 
tions were H3Dha79 (10 1M), alkyl iodide (30 mM), 
Zn powder (0.4 mg), Cu(OAc). (1 mM), Triton 
X-100 (2.0 weight %), and TMEDA (10 mM) in 
sodium acetate (pH 4.5, 0.5 M). Reactions (20 to 
50 ul) were incubated at room temperature. 
With optimized reaction conditions in hand, we 
attempted to synthesize H3K79mel, H3K79me2, 
and H3K79me3 by incubating H3Dha79 with three 
different methyl iodides: monomethy] iodide (3- 
jiodo-N-methylpropan-l-amine, 1), dimethyl iodide 
(3-iodo-N,N-dimethylpropan-l-amine, 2), or tri- 
methyl iodide (Fig. 2C and fig. $5). MALDI-TOF 
MS analysis revealed the disappearance of Dha 
and the concomitant generation of methylated 
lysine residues (Fig. 2A). The coupling reactions 
were highly selective and efficient (normally >80%), 
as demonstrated by mass analyses of tryptic pep- 
tides (Fig. 2A and table S2) and whole proteins 
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Fig. 3. Effects of H3K79 methylation on chromatin transcription. (A) Schematic of the standard in vitro transcription assay. ACCoA, acetyl coenzyme A; 
NTP, nucleoside triphosphate. (B) Effect of methylated H3K79 on chromatin transcription (x axis, lane number; y axis, relative transcription activity. (C) 
Effect of methylated H3K79 on p300-mediated chromatin acetylation. Histone acetylation status was monitored by fluorography. (D) Schematic re- 
presentation of transcriptional activation by H3K79 methylation. Pol, polymerase; GTF, general transcription factor; TBP, TATA-binding protein. 


(fig. S17), and had a good recovery yield (between 
50 and 70%). Selective and differential lysine 
methylations were also demonstrated by Western 
blot analysis using antibodies against H3K79mel, 
H3K79me2, and H3K79me3 (Fig. 2B). To deter- 
mine whether other PTM moieties could be cou- 
pled onto H3Dha79, we synthesized formyl iodide 
[N-(3-iodopropyl)formamide, 4] and acetyl iodide 
[N-(3-iodopropyl)-acetamide, 5; fig. S5] and used 
them for producing histones H3K79N*-formyl and 
H3K79N “-acetyl, respectively, as verified by MALDI- 
TOF MS analysis (fig. S18). 

To demonstrate whether our three-step synthe- 
sis can be used for modification of other proteins, 
we set out to generate ubiquitin variants with 
site-specific modifications. Eight ubiquitin variants 
with various lysine modifications (acetylation and dif- 
ferential methylations) at position 33 or 48 were ef- 
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ficiently generated (fig. S19 and tables S3 to S5). Also, 
we found that different alkyl iodides (iodoethane, 
6; 2-iodopropane, 7; 2-iodo-2-methylpropane, 8; 1- 
jodobutane, 9; iodocyclopentane, 10; 3-iodopronionic 
acid, 11; and tert-butyl-iodobutoxydimetylsilane, 
12) (fig. S5) were efficiently conjugated to Ub33Dha 
(fig. S20 and table S4). In particular, coupling 
of dansyl iodide (5-{dimethylamino}-N-{2-[(3- 
jodopropyl)amino Jethyl}naphthalene-1-sulfonamide, 
13; fig. S5) onto Ub33Dha led to site-specific flu- 
orescent dye labeling via carbon-carbon bond 
formation (figs. S20 and S21 and table S4). Last, 
site-specific Lys” trimethylation of green fluo- 
rescence protein (GFP) by our three-step synthetic 
route led to the desired product GFPK204me3 
but with lower conversion efficiency (~20%) and 
recovery yield (~30%) compared with histone H3 


and ubiquitin (fig. S22 and tables S6 and S7). 


Thus, the structural context within the protein 
may affect the ease of modification. These data 
show that our approach is applicable to a variety of 
proteins. Because each protein behaves differently 
with the diverse chemical reagents, individual- 
ized optimization should lead to efficient imple- 
mentation of our synthetic strategy with other 
proteins. 

Next, we performed biochemical assays with 
the modified histones generated by our three-step 
approach to examine whether they are fully func- 
tional in the biological context. Genome-wide 
analyses of chromatin has revealed that H3K79 
methylation is enriched in actively transcribing 
regions (14), but its detailed role is not yet well 
understood. A transcription assay using in vitro 
assembled chromatin templates and a highly puri- 
fied RNA polymerase II transcription apparatus 
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provides a biochemically defined system to study 
the functions of individual histone modifications. 
The modified histones H3K79mel, H3K79me2, 
and H3K79me3 were first assembled with H2A, 
H2B, and H4: to form histone octamers. The meth- 
ylated histones showed no noticeable difference 
from the unmodified (intact) histone during oc- 
tamer assembly and nucleosome reconstitution 
(fig S23). Recombinant chromatin templates were 
reassembled with the histone octamers and a 
p53ML plasmid and were applied to p53 (activator)- 
and p300 (coactivator)-dependent in vitro tran- 
scription assays (Fig. 3A) (27). Intact chromatin 
lacking methylation showed a low level of tran- 
scription only in the presence of both the activator 
and coactivator, as expected (Fig. 3B). Notably, 
the levels of basal (activator- and/or coactivator- 
independent) transcription from chromatins with 
all three methylation states were greatly enhanced 
(Fig. 3B). Also, activator- or coactivator-dependent 
transcription from H3K79-methylated chromatin 
was elevated (Fig. 3B), demonstrating a direct 
stimulatory effect of H3K79 methylation on chro- 
matin transcription. More importantly, we also 
found that H3K79 methylation increased histone 
acetylation mediated by p300 (Fig. 3C). Histone 
acetylation was differentially affected by the level 
of H3K79 methylation (Fig. 3C). Thus, H3K79 
methylation is indeed positively associated with 
transcription activation through p300-mediated 
histone acetylation, which is differently affected 
by the various methylation states (Fig. 3D). These 
results illustrate that the methylated histones are 
fully active and, depending on the modification 
level, functionally distinct, demonstrating the util- 
ity and gentle nature of our three-step approach. 

An open question concerns the diastereose- 
lectivity of our coupling reaction. It is known 
that radical and thiol conjugate additions to Dha 
lead to epimeric mixtures (22-24); a definitive 
analysis of the epimeric ratio will require high- 
resolution crystallography of the protein. However, 
the steric context of the local protein conformation 
may substantively affect the final diastereomeric 
ratio of the products. The fact that our synthetic 
proteins H3K79mel, H3K79me2, and H3K79me3 
were well recognized by antibodies, and that these 
modified histones could assemble into octamers 
with biological in vitro activity, underscores the 
utility of our approach. 

We anticipate that with well-tailored alkyl 
iodides [e.g., (25)], our approach can be extended 
to produce designer proteins with diverse forms 
of chemical modifications (e.g., glycosylated amino 
acids and phosphotyrosine). Such efforts will 
markedly expand the available chemical diversity 
in proteins and facilitate the study of many PTM- 
mediated biological processes. 
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OCEAN CHEMISTRY 


Ocean mixing and ice-sheet 
control of seawater *U/”°U 
during the last deglaciation 


Tianyu Chen,’* Laura F. Robinson,’ Matthew P. Beasley,’ Louis M. Claxton," 
Morten B. Andersen,”? Lauren J. Gregoire,* Jemma Wadham,° 


Daniel J. Fornari,® Karen S. Harpp” 


Seawater 7°4U/278U provides global-scale information about continental weathering and is 
vital for marine uranium-series geochronology. Existing evidence supports an increase in 
234/238 since the last glacial period, but the timing and amplitude of its variability has 
been poorly constrained. Here we report two seawater 7°4U/728U records based on 
well-preserved deep-sea corals from the low-latitude Atlantic and Pacific Oceans. The 
Atlantic 2°4U/2°U started to increase before major sea-level rise and overshot the modern 
value by 3 per mil during the early deglaciation. Deglacial 7°*U/23°U in the Pacific 
converged with that in the Atlantic after the abrupt resumption of Atlantic meridional 
overturning. We suggest that ocean mixing and early deglacial release of excess 724U from 
enhanced subglacial melting of the Northern Hemisphere ice sheets have driven the 


observed 734U/28U evolution. 


he last deglaciation [18.0 to 10.5 thousand 
years ago (ka)] saw massive changes in 
Earth’s surficial environments, including 
in temperature and precipitation, as well as 
the retreat of the Northern Hemisphere (NH) 
ice sheets and sea-level rise (J, 2). These processes 
have the potential to induce large variability in the 
weathering of the continents and the composition 
of chemical fluxes to the ocean. The ratio 7*U/ 
381] is one of the isotopic signatures with the 
potential to have recorded global changes in con- 
tinental weathering during this critical climate 
transition (3). 
The activity ratio of ?**U to 7°5U in the modern 
seawater is ~15% higher than secular equilibrium 


(4) because of the relatively mobile nature of 7?4U 
induced by a-recoil effects (5) in the weathered 
host rocks. 7°U is enriched relative to 7?°U at 
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particle boundaries or damaged lattices and is 
expected to be preferentially released in the ini- 
tial phases of weathering (6). Leaching experi- 
ments (7) support that the early fraction of granite 
leachates is high in both U concentration and 
PAU [8*U = CU Uactivity ratio — 1) x 1000)]. 
In the last glacial period, subglacial drainage of 
meltwaters from a limited area of ice-sheet inte- 
rior to the margins may have been possible (8), 
analogous to the Antarctic Ice Sheet today, where 
basal meltwater is routed to the margins via sub- 
glacial channels (9). Nevertheless, a large fraction 
of the glaciated NH continents likely had very 
limited chemical weathering flux to the ocean 
because of widespread freezing conditions at the 
ice-sheet base (10). It is reasonable to assume that 
a labile pool of excess °**U due to o-recoil would 
have accumulated in the frozen sediments or iso- 
lated subglacial lakes and ponds in the wet-based 
zones under these ice sheets; in fact, high dis- 
solved 8**U of up to ~4000 per mil (%o) has been 
observed in the Antarctic Taylor Valley (71), a re- 
gion thought to be hydrologically connected to 
the nearby ice sheets (12). In the nonglaciated 
regions, 5”**U released by weathering would also 
respond to tectonic and precipitation changes (3). 
A reliable reconstruction of oceanic 57°*U thus 
offers a potentially important means for tracing 
global-scale weathering variability during climate 
transitions. 

The long residence time of U in seawater 
[~400 thousand years (ky) (13, 14)] would lead to 
the expectation that any equilibrium response to 
external inputs should be more than an order of 
magnitude longer than the deglacial time scale 
(~10 ky). However, a growing number of studies 
have indicated that seawater 8°°*U might have 
been lower during the last glacial period than 
during the Holocene and previous interglacials 


(3, 15-17) and might have also changed on millen- 
nial time scales (78). These observations imply 
that there have been large, relatively rapid changes 
in the U isotope budget of the ocean and are 
supported by an updated compilation of pub- 
lished initial &°**U data from corals for the last 
30 ka (Fig. 1A). However, the extensive scatter in 
the data, which is likely due to diagenesis (19), has 
limited the ability to constrain the timing and 
magnitude of 8°°U variations through time. 

To put more robust limits on seawater 57*U 
evolution and infer past changes in chemical 
weathering, we report two well-constrained U 
isotope records for the past 30 ka based on well- 
preserved deep-sea corals recovered from the low- 
latitude North Atlantic and the Pacific Galapagos 
platform (Fig. 1B and figs. S1 and S2), with ad- 
ditional samples for reference from 50 to 30 ka 
(19) (fig. $3). The general trend of 8”°*U evolu- 
tion in these records agrees with previous studies 
(3, 15-17), although the newly sampled glacial- 
period deep-water corals tend to exhibit higher 
initial 8°?“U than the surface corals in each ocean 
basin (fig. $3). Our data thus suggest a somewhat 
smaller glacial-Holocene 8***U difference of only 
3 to 4%o (Fig. 1). Atlantic 8?°“U started to in- 
crease around 22 to 20 ka (Fig. 1B), followed by 
a rapid increase of ~6%o up to 150%o during 
Heinrich Stadial 1 (HSI, 18.0 to 14.6 ka). Available 
Pacific data are less well resolved, but the 57?*U 
values are lower than those in the Atlantic during 
early HS1. The &*“U in the Atlantic and Pacific 
records converged to the modern level during 
the Bolling-Allerod (B-A, 14.6 to 12.9 ka). Before 
the last glacial maximum (25 to 50 ka), our data 
set is consistent with a lower 6”°*U than that of 


the Holocene (fig. S3), but it is not well enough 
resolved to identify potential millennial-scale 
changes (18). 
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There are several possible causes for the ob- 
served deglacial increase in seawater 8*°*U. Coast- 
al regions have been inferred to retain U with 
high 6°“U during sea-level highstands (J6). Re- 
dissolution of coastal U has been hypothesized 
to drive increases in 57?*U when sea level rises 
(16). If that is the case, an increase in seawater 
8°4U should closely follow sea-level rise. The 
increase in Atlantic 5”°*U does appear to have 
coincided with the initiation of sea-level rise 
(Fig. 2), but most of the increase in 87°!U oc- 
curred before the major sea-level rise in the late 
deglaciation. Hence, we suggest that although 
U stored in coastal areas could be important for 
8?4U variability in other situations (78), it was 
probably not the main driver of the last deglacial 
5°*U evolution. An increase in deep-water oxy- 
genation may release redox-sensitive U from the 
seafloor sediments, possibly affecting the seawater 
U budget during deglaciation. The early deglacia- 
tion is thought to have had lower oxygen con- 
centrations than those of the modern day in the 
upper ocean (<1.5 km) (20). Increased bottom- 
water oxygenation in the Atlantic and deep Pacific 
only occurred after HS1 (20) with the resumption of 
North Atlantic deep overturning (27), which oc- 
curred later than the observed increase in 8°*U. 
Together, these results suggest that external sources 
of excess **“U or ocean mixing have to be involved 
to explain the observed 5”**U variability. 

Models (22) and proxies such as 7?'Pa/?°°Th 
(21, 23) support a reduced deep Atlantic merid- 
ional overturning circulation (AMOC) (Fig. 2), as 
well as a reduced surface Gulf Stream (24) and 
Agulhas leakage (25), during HS1. These pro- 
cesses likely resulted in an upper Atlantic that 
was more isolated from the rest of the ocean than 
it is today (19). Increased isolation of the upper 
Atlantic (~2.0 km, including the depth range of 


Fig. 1. Seawater 52°4U evolution over the 
past 30 ky, reconstructed from corals. 

(A) Compilation of published coral initial 87°4U 
(8°34U,) within the range of 135 to 155%o and 
with 20 errors < 3%o (19). Data outside of 
this range have been truncated. (B) Low- 
latitude North Atlantic records with +20 con- 
fidence lines (black; the green line shows the 
mean). Also shown are the initial 87°U values 
for the low-latitude Pacific, reconstructed 
from Galapagos deep-sea corals (the gold line 
shows the mean, with the dotted section 
highlighting the data gap in HS1). One 
Galapagos coral data point from the B-A with 
an initial 8°24U higher than 155%o has likely 
experienced diagenesis and is not shown (19). 
Black dashed lines mark the modern seawater 
signature. 
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the deep-sea corals in this study) would act to 
reduce the effective U residence time and allow 
its &°°*U to change more rapidly than in the 
Pacific. We applied a two-box model, consisting 
of an upper Atlantic-Arctic box and a “rest of the 
ocean” box, to study the influence of changing 
external sources and changing ocean circulation 
on seawater 8°‘U (19). Modern high-latitude riv- 
erine inputs have considerably higher 8°°*U than 
middle- to low-latitude inputs and mainly supply 
the Arctic and polar North Atlantic (table S1) (19). If 
the 5"U and U fluxes of all external sources are 
kept constant throughout the past 25 ky, our 
model shows that a slowdown of ocean circulation 
during HS1 can result in a resolvable difference 
in 8™U between the upper Atlantic and the rest of 
the ocean, depending on the degree of reduction in 
exchange flux (Fig. 3A). This result is consistent 
with the difference in 8°**U between the Pacific 
and Atlantic records (Fig. 1) and contrasts with 
the general assumption that U isotopes are homo- 
genous throughout the global ocean [i.e., differ- 
ences no larger than 0.4%o (4)]. Our result also 
raises the possibility that other isotope systems 
with relatively long residence times might exhibit 
differences between different ocean basins during 
periods of reduced ocean mixing. 

Ocean circulation, however, cannot account for 
the overall glacial-interglacial increase of ~3%o or 
more in the 8”“U of both ocean basins (Fig. 3A). 
External **U inputs must have increased relative 
to *°U during the early deglaciation. With three 
times the modern riverine U flux, a 3%o shift in 
oceanic 6”**U is possible during HSI (fig. $4). 
However, given that the hydrological cycle was 
probably weaker during the late last glacial pe- 
riod and early deglaciation than at present (2), in- 
creased dissolved U flux from global rivers alone 
as a driver for the 8°**U change is unlikely. An 
increase in 8°°*U in the continental inputs is 
therefore likely to have been important in sea- 
water 5”**U evolution. There are no direct mea- 
surements of the 5”**U of past watersheds, so we 
compiled 5”**U data from speleothems that grew 
during the relevant time period. Although speleo- 
them 8°*U does not necessarily reflect the pri- 
mary surface-weathering signal, because it includes 
influences such as the percolation of meteoric 
waters from the surface down into the cave, it may 
still provide a first-order indication of the varia- 
bility in the hydrological cycle that controlled the 
overall ***U budget available for weathering (26). 
The data come from the middle to low latitudes 
(fig. S5), and they do not exhibit any distinctive 
8°*4U shifts from 30 to 15 ka, suggesting that 
weathering variability in these areas was not 
large enough to account for the oceanic &°?*U 
observations. 

Instead, high-latitude processes could have 
played a key role in driving the increase in global 
seawater 5”°“U from the last glacial period to 
15 ka. In the early part of the last deglaciation, the 
bases of North American ice sheets are thought to 
have become increasingly wet (10). We hypothesize 
that 7*U-enriched water was released from sub- 
glacial melt reservoirs with a prolonged residence 
time and from leaching of the previously frozen 
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Fig. 2. Seawater 57°4U evolution compared with other climate records. (A) Modeled retreat rate 
of the Northern Hemisphere (NH) and Southern Hemisphere (SH) ice sheets (33). (B) Modeled basal 
melting rate of the North American ice sheets (28) and ice discharge of the Laurentide Ice Sheet (29). 
(C) ?*"Pa/*°Th from North Atlantic deep sediment cores (21, 23). (D) Sea-level history (34). (E) Our 
reconstructed 8°°4U evolution in the upper Atlantic and Pacific. Green solid line, Atlantic mean; green 
dashed line, Atlantic 20 error; gold line, Pacific mean (dotting, HS1 data gap); gold shading, estimated 
range of Pacific 8°54U. black dotted line, modern seawater 8°*4U:; Gt, gigaton; y, year. 


subglacial sediments by basal meltwater over 
this period. In both cases, subglacial meltwaters 
are likely to be enriched in recoiled ?**U (7, 11). 
In addition, sediments (and their former pore- 
waters) frozen within the bases of icebergs might 
also have contributed to releasing U and nu- 
trients to the ocean (27) during this period. The 
reconstructed number of ice streams based on 
field evidence (8) and the modeled subglacial 
melt rate (10, 28) (Fig. 2B) of the North Amer- 
ican ice sheets were much higher during the 
early deglaciation (before 15 ka) than later. It is 
notable that the modeled ice discharge (29) of 
the Laurentide Ice Sheet was also high during the 
early deglaciation, reaching a peak at late HS1 
(Fig. 2). This timing is consistent with the en- 
hanced release and transport of excess **U to 
the ocean during the early deglaciation. In this 
case, the peak oceanic &*“U indicates that basal 
water from the whole ice-sheet interior may have 
been active in exporting to the margin during 


HS1. In another modeling experiment (19), the 
sensitivity of oceanic responses to inputs with 
different 5”°*U ratios was tested. With an aver- 
age 8”*U of ~800%o for the input to the upper 
Atlantic during HS1, the model is able to repro- 
duce the observed amplitude of the increase in 
8°*4U from the last glacial period to 15 ka (Fig. 
3B). Amore realistic case might be an increase 
in both the U flux and the &**U of the high- 
latitude continental inputs, but deconvolving 
these two factors is difficult. There is a hint from 
the compiled data that the surface ocean had 
lower 5°*“U than the intermediate-depth ocean 
(fig. $3). Our extended modeling experiments 
(table S2) are unable to replicate this feature, 
even with all high-latitude excess °**U routed to 
the intermediate ocean via the polar surface 
Atlantic and no deep-ocean overturning and mix- 
ing (19). This indicates that other mechanisms, 
such as local influences or diagenetic processes, 
are responsible for those differences. 
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Fig. 3. Sensitivity experiment assessing the response of seawater 574U to ocean circulation and external inputs. (A) Effect of ocean mixing slowdown 
alone (sv, sverdrup). (B) Effect of variable 8°24U in inputs to the upper Atlantic, applying a 50% reduction of the glacial exchange flux during HS1. The area 
between the dashed-dotted curves is the range of low-latitude Atlantic 8°°4U. External U fluxes through time in both experiments are kept at the modern inputs. 
The spectrum of colors denotes different experiments (solid lines, upper Atlantic; dashed lines, other ocean). Further details are in (19). 


The transition from HS1 to the B-A is marked 
by a distinct decrease in 8°*“U in the low-latitude 
Atlantic by ~3%o from about 150%o to the modern 
seawater signature. The Pacific &°°*U appears to 
have risen at the same time, converging with the 
Atlantic 5”°*U during the mid-B-A. We suggest 
that this convergence is due to the abrupt in- 
crease in overturning of the Atlantic at the end 
of HS1 (21, 23) (Figs. 2C and 3A), although the 
depletion of the subglacial excess **U pool might 
also have played a role (19). Stabilized Holocene 
seawater 5”"U afterward implies that the U cycle 
in the modern ocean is likely close to a steady 
state (30). Nevertheless, perturbation of oceanic 
U isotopes by polar processes might have con- 
tinued to occur even during the climatically and 
oceanographically more stable late Holocene. For 
example, widespread collapse of the Ross Ice Shelf 
and the export of old materials from inland Ant- 
arctica is inferred to have occurred at ~5 to 1.5 ka 
in response to regional warming (37). These pro- 
cesses may have been accompanied by a 7**U-rich 
flux to the Southern Ocean, although any shift 
in the whole-ocean 5”**U was probably limited. 
On longer orbital time scales, seawater 8°**U is 
thought to be higher during past interglacial 
periods and lower during glacial periods [e.g., 
(3, 16, 17)], with the decreases potentially as- 
sociated with low sea level (16). By comparison 
with the deglacial mechanism, our study implies 
that progressive NH glaciation could have reduced 
the weathering input from high-latitude continen- 
tal areas, leading to lower glacial oceanic 5”**U. 

The retreat of the NH ice sheets started about 
20 ka and continued through HSI (Fig. 2A) (J), 
considerably earlier than the enhanced surface 
melting that dominated ice-sheet mass loss and 
sea-level rise in the late deglaciation and early 
Holocene (29). Our data provide evidence for en- 
hanced subglacial melting of the NH ice-sheet 
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interior during the early deglaciation, supporting 
the notion that basal melting and sliding repre- 
sents one of the feedbacks involved in enhancing 
early deglaciation as a result of the buildup of 
very large NH ice sheets (0). An interesting con- 
sequence of the basal melt inputs may be the as- 
sociated release of nutrients to the ocean. Recent 
work from the Greenland Ice Sheet indicates that 
dissolved phosphorus yields are at least equal to 
those from the Mississippi or Amazon rivers (32). 
In this regard, nutrients from direct subglacial 
weathering should be considered in future research 
as a potential source fueling productivity in the 
North Atlantic during HS1. 
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NEUROREGENERATION 


Injury-induced ctgfa directs glial 
bridging and spinal cord 
regeneration in zebrafish 


Mayssa H. Mokalled,* Chinmoy Patra,”* Amy L. Dickson,’ Toyokazu Endo," 


Didier Y. R. Stainier,” Kenneth D. Poss'+ 


Unlike mammals, zebrafish efficiently regenerate functional nervous system tissue 
after major spinal cord injury. Whereas glial scarring presents a roadblock for 
mammalian spinal cord repair, glial cells in zebrafish form a bridge across severed 
spinal cord tissue and facilitate regeneration. We performed a genome-wide profiling 
screen for secreted factors that are up-regulated during zebrafish spinal cord 
regeneration. We found that connective tissue growth factor a (ctgfa) is induced in 
and around glial cells that participate in initial bridging events. Mutations in ctgfa 
disrupted spinal cord repair, and transgenic ctgfa overexpression or local delivery of 
human CTGF recombinant protein accelerated bridging and functional regeneration. 
Our study reveals that CTGF is necessary and sufficient to stimulate glial bridging 


and natural spinal cord regeneration. 


he spinal cord of adult zebrafish recovers 

spontaneously after injury (Fig. 1A and fig. 

S1, A and B). Efficient axon growth, adult 

neurogenesis, and absence of glial scarring 

distinguish this injury response from that 
in mammals (J, 2). After initial inflammation, 
ependymal cells proliferate and glia form a bridge 
that is thought to provide a scaffold for axonal 
growth (3). The severed cord reconnects, and new 
neuronal connections lead to functional recovery 
(Fig. 1A) (4). Here, we analyzed extracellular fac- 
tors up-regulated in the regenerating zebrafish 
spinal cord. 

To identify potential secreted, pro-regenerative 
signaling molecules, we screened zebrafish tran- 
scriptomes for genes induced after spinal cord 
injury (Fig. 1B and table S1). Our screen iden- 
tified seven genes encoding secreted extra- 
cellular proteins, including fibronectin 1 a (fnla), 
previously implicated in axon growth promotion 
(5, 6). Transcripts for fnla, fnib, connective tis- 
sue growth factor a (ctgfa), myostatin b (mstnb), 
and stanniocalcin 1 like (stcil increased in ep- 
endymal cells at 1 and 2 weeks post-injury (wpi) 
(Fig. 1C and fig. SIC). Whereas fnla was ex- 
pressed around the entirety of the central canal 
at 2 wpi, mstnb and ctgfa were predominantly 
expressed in the dorsal and ventral ependyma, 
respectively (Fig. 1C). 

CTGF is a matricellular, multifunctional pro- 
tein that can influence the activity of multiple 
major signaling pathways, affecting cell adhe- 
sion, migration, proliferation, and differentiation. 
CTGF expression increases after central nervous 
system (CNS) trauma in rodents, but its function 
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after spinal cord injury has not been elucidated 
(7-9). Because ctgfa is induced upon spinal cord 
injury in zebrafish, we hypothesized that it may 
have pro-regenerative roles. We analyzed cigfa 
expression along the rostrocaudal spinal cord 
axis (Fig. 1D). At 1 wpi, ctgfa transcription was 
induced in multiple cell types across the lesion 
site and in ependymal cells at the central canal 
near the lesion. At this time point, we detected 
strongest ctgfa expression in the ventral epen- 
dyma. By 2 wpi, cigfa expression localized to 
ventral ependymal cells and marked the cellular 
bridge that had formed at the lesion site. Ex- 
pression declined beyond 3 wpi (fig. S2A). Thus, 
cigfa expression correlates with formation of 
the glial bridge. 

To identify the cell populations that express 
ctgfa during spinal cord repair, we generated 
transgenic reporter zebrafish with a 5.5-kb ge- 
nomic sequence upstream of the cigfa translation- 
al start site fused to an enhanced green fluorescent 
protein (EGFP) reporter cassette. ctgfa:EGFP flu- 
orescence resembled endogenous ctgfa mRNA 
expression in spinal cord tissue at 2 wpi (fig. S2, 
B and C). As early as 5 days post-injury (dpi), 
domains of ctgfa:EGFP and glial fibrillary acidic 
protein (GFAP), a marker of multiple glial cells 
in the CNS, overlapped within a subpopulation 
of glial cells at the injury site (Fig. 1E). We inter- 
pret these to be bridging cells. Similarly, ctgfa: 
EGFP colocalized with the GFAP* bridge at 2 wpi 
(Fig. 1E). Comparison of ctgfa:EGFP and GFAP 
expression on serial cross sections revealed that 
~97% of GFAP* bridging glia at the lesion core 
were also EGFP” at 1 wpi (fig. S3). Further away 
from the lesion, ctgfa:EGFP was mainly present 
in ventral ependymal cells (fig. S3). We also de- 
tected ctgfa:EGFP in skeletal muscle, bone cells, 
and reactive fibroblast-like cells around the le- 
sion site. ctgfa-driven expression suggested de- 
lineation of “pioneer” bridging glia. 


To determine whether ctgfa is required for 
spinal cord regeneration, we generated a ctgfa 
mutant allele (cigfa?”®”°: referred to as ctgfa”) 
that harbors a frameshift-causing 7-nucleotide 
deletion within the third exon of the cigfa locus 
(fig. S4, A to C). ctgfa! ~ animals are adult viable 
and appear to have unaffected motor function 
capacity (10) (fig. S4D). However, ctgfa! ~ ani- 
mals showed diminished swim capacity after 
spinal cord injury, with no significant functional 
recovery between 2 and 6 wpi (Fig. 2A). Heter- 
ozygous (ctgfa*’~) animals showed partial recov- 
ery of swim capacity by 6 wpi (Fig. 2A). At 4: wpi, 
anterograde axon tracing indicated that axon 
regeneration across the lesion site was reduced by 
~25% in ctgfa*/- and ~60% in ctgfa~/~ spinal cords 
proximal to the lesion, and by ~40% in ctgfa*/~ 
and ~80% in ctgfa cords distally (Fig. 2B). Thus, 
ctgfa is required for spinal cord regeneration. 

Deficits in glial bridging might underlie the 
defects in spinal cord regeneration displayed 
by cigfa mutants. Using GFAP and acetylated 
a-tubulin immunohistochemistry, we observed 
robust glial bridges in wild-type and ctgfa*/~ ani- 
mals at 4 wpi (Fig. 2C and fig. S5A). However, 
ctgfa/~ animals displayed ~71% less bridging than 
did wild-type clutchmates (Fig. 2C and fig. S5A). 
At 2 wpi, glial cells within ctgfa~’~ injury sites 
often failed to extend projections into the lesion 
(fig. S5B). 5-ethynyl-2’-deoxyuridine (EdU) incor- 
poration assays showed that ctgfa7/~ zebrafish 
displayed a ~48% reduction in glial cell prolif- 
eration at 1 and 2 wpi (Fig. 2D and fig. S6). 
EdU*GFAP” cell numbers were comparable in 
wild-type, ctgfa* ~, and ctgfa’ ~ tissues at 1 wpi 
(Fig. 2D), which suggests that the effects of 
cigfa mutations were preferential to glial cells. 
Thus, ctgfa is required for the changes in pro- 
liferation and morphology of glial cells during 
spinal cord regeneration. 

To examine the effects of excess ctgfa on spi- 
nal cord regeneration, we generated and injured 
transgenic fish that express full-length ctgfa un- 
der control of a heat-inducible promoter (hsp70: 
ctgfa-FL) (fig. S7). Recovery of swim capacity was 
improved in ctgfa-overexpressing animals given 
daily heat shocks and assessed at 2, 4, and 6 wpi 
(Fig. 3A). Swim capacity was comparable between 
sham-injured hsp70:ctgfa-FL and wild-type clutch- 
mates at 2 and 6 weeks after heat shock (Fig. 3A), 
indicating contextual effects of ctgfa overexpres- 
sion. Histology indicated increased bridging and 
axon regeneration in ctgfa-overexpressing fish 
relative to controls at 2 and 4 wpi, respectively 
(Fig. 3, B and C, and fig. S8A). Thus, whole-animal 
ctgfa overexpression promotes regeneration after 
spinal cord injury. 

CTGF harbors four protein interaction domains 
and a protease domain that self-cleaves CTGF 
into profibrotic N-terminal and proliferative 
C-terminal peptides (77, 12). To determine the 
active portion of zebrafish Ctgfa during spinal 
cord regeneration, we created transgenic fish with 
heat-inducible expression of either N-terminal 
or C-terminal ctgfa fragments (hsp70:ctgfa-NT 
and hsp70:ctgfa-CT) (fig. S7A). Only ctgfa-CT over- 
expression recapitulated the pro-regenerative 
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Fig. 1. Identification of ctgfa from a screen for regulators of spinal cord 
regeneration. (A) Schematic of the multistep process of spinal cord regenera- 
tion in zebrafish. (B) A screen for secreted factors expressed during spinal cord 
regeneration (fpkm, fragments per kilobase of transcript per million; GO, Gene 
Ontology). (C) In situ hybridization on spinal cord cross sections at 1 and 
2 wpi and in uninjured control tissue. Sections proximal to the lesion from the 
rostral side are shown; dashed lines delineate the central canals. The canal 


dilates after injury. (D) ctgfa in situ hybridization on longitudinal spinal cord 
sections at 1 and 2 wpi and in uninjured control tissue. (E) ctgfa:EGFP reporter 
expression and GFAP immunohistochemistry during early bridging events at 
5 dpi (top) and after bridge formation at 2 wpi (bottom). The middle panel 
shows a high-magnification view of the boxed area in the top panel. In (D) and 
(E), dashed lines delineate spinal cord edges, arrows point to sites of bridging, 
and arrowheads point to ventral ependymal cells. Scale bars, 50 um. 


effects of ctgfa-FL (Fig. 3, D to F). By 6 wpi, swim 
capacity was markedly increased in ctgfa-CT- 
overexpressing animals relative to ctgfa-NT- 
overexpressing or wild-type clutchmates (Fig. 3D). 
Anatomically, ctgfa-CT overexpression resulted 
in a factor of >3 increase in glial bridging at 
2 wpi and a factor of >2 increase in axon growth 
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at 4 wpi (Fig. 3, E and F, and fig. S8B). These 
experiments indicate that the pro-regenerative 
activity of Ctgfa maps to its C-terminal domains. 

To examine whether the effects of Ctgfa aug- 
mentation could be reproduced by localized de- 
livery into the spinal cord lesion site, we injured 
wild-type animals and applied human recombi- 


nant CTGF (HR-CTGF-FL and HR-CTGF-CT) pep- 
tides adjacent to the lesion site at 5 and 10 dpi 
using a gelfoam sponge. We then assessed re- 
generation at 2 and 4 wpi, corresponding to 9 
and 23 days after initial treatment. Human CTGF 
and zebrafish Ctgfa are 81% identical and 87% 
similar at the amino acid level within the four 
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Fig. 2. ctgfa is necessary for glial bridging and spinal cord regeneration. 
A) Swim assays assessed animals’ capacity to swim against increasing water 
current inside an enclosed swim tunnel. Seven wild-type (ctgfa*’*), 10 ctgfa 
heterozygous (ctgfa*’~), and 10 mutant (ctgfa~’~) clutchmates were assayed 
at 2, 4, and 6 wpi. Statistical analyses of swim times are shown for ctgfa’”~ 
red) and ctgfa*’~ (orange) relative to wild type. Recovery of ctgfa’’~ animals 
was not significant between 2 and 6 wpi. (B) Anterograde axon tracing in 
ctgfa mutant animals at 4 wpi. For quantification of axon growth at areas proxi- 
mal (shown in images) and distal to the lesion core, 16 wild-type, 17 ctgfa*””, 
and 20 ctgfa-’~ zebrafish from two independent experiments were used. 
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(C) GFAP immunohistochemistry in ctgfa mutant spinal cords at 4 wpi. Per- 
cent bridging was quantified for 10 wild-type, 9 ctgfa*’~, and 10 ctgfa-’~ 
clutchmates from three independent experiments. Dashed lines delineate 
glial GFAP staining; arrows point to sites of bridging. (D) Glial cell prolifera- 
tion in wild-type, ctgfa*’~, and ctgfa~’~ spinal cords at 1 wpi. Arrowheads 
indicate EdU-positive gfap:GFP-positive cells. For quantification of glial 
proliferation indices (left) and number of EdU-positive gfap:GFP-negative 
cells (right), 10 wild-type, 12 ctgfa*’~, and 15 ctgfa~’~ animals from two 
independent experiments were used. *P < 0.05, **P < 0.01, ***P < 0.001; ns, 
not significant. Scale bars, 100 um. 


protein interaction domains (fig. S9). Treatment 
with either HR-CTGF-FL or HR-CTGF-CT enhanced 
zebrafish spinal cord regeneration (Fig. 3, G to I, 
and fig. S8C). Swim capacity was improved in 
HR-CTGF-FL-treated or HR-CTGF-CT-treated 
animals by 2 wpi (Fig. 3G), near that of uninjured 
animals. As early as 1 wpi, we observed increased 
GFAP expression and a factor of >10 enhance- 
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ment of glial bridging in HR-CTGF-CT-treated ani- 
mals relative to vehicle-treated controls (fig. S8D). 
Bridging remained greater at 2 wpi by a factor 
of ~3 (Fig. 3H). At 4 wpi, axon regeneration was 
increased by a factor of ~1.5 to 3 in HR-CTGF- 
FL-treated or HR-CTGF-CT-treated animals re- 
lative to vehicle controls (Fig. 31). Application of 
exogenous CTGF protein at the lesion site res- 


cued functional and anatomical spinal cord re- 
generation in ctgfa mutants, indicating specificity 
of both the phenotype and treatment (fig. S10). 
Thus, human CTGF protein, ostensibly via its 
C-terminal domains, enhances spinal cord regen- 
eration in zebrafish. 

Glial cell responses are thought to dictate the 
outcomes of spinal cord injury across species. We 
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Fig. 3. ctgfa promotes glial bridging and spinal cord regeneration. (A) Swim 
assays determined motor function recovery of 10 hsp7O:ctgfa-FL (green) and 
10 wild-type (gray) clutchmates at 2, 4, and 6 wpi. For sham controls, 8 ctgfa- 
FL-overexpressing (dashed green) and 7 wild-type (dashed gray) zebrafish 
were analyzed. Statistical analyses of swim times are shown for injured 
ctgfa-FL (green) relative to wild type. (B) GFAP immunohistochemistry was 
used to quantify glial bridging at 2 wpi in 18 ctgfa-FL—-overexpressing and 16 
wild-type zebrafish from three independent experiments. (C) Anterograde 
axon tracing at 4 wpi after ctgfa-FL overexpression. Quantification at areas prox- 
imal (Shown in images) and distal to the lesion core represents 12 ctgfa-FL- 
overexpressing and 10 wild-type zebrafish from two independent experiments. 
(D) Swim assays for 8 ctgfa-CT—overexpressing (blue), 10 ctgfa-NT-over- 
expressing (violet), and 9 wild-type clutchmate animals (wild-type controls 
for CT in dashed blue and for NT in dashed violet). Statistical analyses of 
swim times are shown for ctgfa-CT (blue) relative to wild type. (E) Glial 
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bridging at 2 wpi in 19 ctgfa-CT—overexpressing and 20 wild-type animals 
from two independent experiments. (F) Anterograde axon tracing at 4 wpi 
after ctgfa-CT overexpression. Quantification represents 16 ctgfa-CT-over- 
expressing and 16 wild-type animals from two independent experiments. 
(G) Swim capacity was assessed for 9 vehicle-treated (gray), 8 HR-CTGF- 
FL-treated (green), and 9 HR-CTGF-CT-treated (blue) animals. Statistical 
analyses are shown for HR-CTGF-FL (green) and HR-CTGF-CT (blue) treat- 
ments relative to vehicle controls. (H) Glial bridging at 2 wpi in 18 HR-CTGF- 
CT-treated and 15 vehicle-treated animals from three independent experiments. 
(I) Anterograde axon tracing at 4 wpi after HR-CTGF-CT treatment. Quan- 
tification represents 18 vehicle-treated, 16 HR-CTGF-FL-treated, and 14 HR- 
CTGF-CT-treated animals from two independent experiments. For histology 
in (B), (E), and (H), dashed lines delineate glial GFAP staining and arrows 
point to sites of bridging. *P < 0.05, **P < 0.01, ***P < 0.001. Scale bars, 
100 um. 


found that Ctgfa promotes zebrafish spinal cord 
regeneration, at least in part by facilitating the 
proliferation and bridging activity of pioneer glial 
cells that also express ctgfa. Other cell types also 
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induce ctgfa in the injured zebrafish spinal cord 
and might contribute additional functions dur- 
ing regeneration. In mammals, reactive gliosis 
causes scarring and inhibits regeneration (13, 14), 


although evidence indicates that the lesion 
contains a heterogeneous pool of glial cells that 
release both axon growth-inhibiting and axon 
growth-permissive factors (6, 15-19). We suggest 
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that identifying the mammalian glial subtype 
that can produce CTGF, or that is competent to 
respond to it, could reveal a pro-regenerative mam- 
malian counterpart to the zebrafish bridging glia. 
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A selective insecticidal protein 
from Pseudomonas for controlling 


corn rootworms 


Ute Schellenberger,’* Jarred Oral,’ Barbara A. Rosen,’ Jun-Zhi Wei,’ Genhai Zhu,’ 
Weiping Xie,’ Mark J. McDonald,’ David C. Cerf,’ Scott H. Diehn,” Virginia C. Crane,” 
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The coleopteran insect western corn rootworm (WCR) (Diabrotica virgifera virgifera LeConte) 
is a devastating crop pest in North America and Europe. Although crop plants that produce 
Bacillus thuringiensis (Bt) proteins can limit insect infestation, some insect populations have 
evolved resistance to Bt proteins. Here we describe an insecticidal protein, designated 
IPDO72Aa, that is isolated from Pseudomonas chlororaphis. Transgenic corn plants expressing 
IPDO72Aa show protection from WCR insect injury under field conditions. IPDO72Aa leaves 
several lepidopteran and hemipteran insect species unaffected but is effective in killing WCR 
larvae that are resistant to Bt proteins produced by currently available transgenic corn. 
IPDO72Aa can be used to protect corn crops against WCRs. 


he coleopteran insect western corn rootworm 

(WCR) (Diabrotica virgifera virgifera Le Conte) 

and related species cause extensive damage 

to corn (Zea mays L.) in North America and 

Europe, resulting in crop yield losses and 
management costs (1-3). Farmer adoption of corn 
plants that produce insecticidal proteins for con- 
trolling major corn pests, including WCRs, has 
increased productivity in areas with insect pres- 
sure (4). Commercial transgenic crops introduced 
to date for insect control have been developed 
with genes from Bacillus thuringiensis (Bt) (5, 6). 
These genes encode insect-specific crystal proteins 
(Cry) or vegetative insecticidal proteins (Vip). 
Four of them—mCry3A, Cry3Bbl1, eCry3.1Ab, and 
the binary protein complex Cry34Ab1/Cry35Ab1— 
have been commercialized for WCR control in 
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the United States (5, 7). Emergence of WCR resist- 
ance to mCry3A, Cry3Bbl1, and eCry3.1Ab pro- 
teins (8-10) under field conditions demonstrates 
the need for broader, more durable strategies to 
control insect infestations and limit the develop- 
ment of resistance. The genetic diversity of Bt has 
already been explored in a search for additional 
insecticidal proteins (5). An alternative approach 
involves disabling WCR genes by RNA interfer- 
ence through the expression of double-stranded 
RNA (JJ). We hypothesized that, like Bt, other 
soil bacteria that live with soil-dwelling coleop- 
teran larvae may produce proteins that kill corn 
rootworms. Here we describe a potent and sel- 
ective insecticidal protein from a Gram-negative 
bacterium, Pseudomonas chlororaphis, which 
we identified through screening soil-isolated 
microbial strains. This protein protects trans- 
genic corn plants from WCR injury under field 
conditions. 

We collected soil samples from the U.S. states 
of Iowa, Illinois, and Wisconsin, and isolated 


microbial strains under growth conditions to 
favor species other than Bt. Crude protein extracts 
from these isolates were screened for activity 
against WCRs in artificial diet bioassays. One 
isolate—identified as P. chlororaphis by 16S ribo- 
somal DNA sequence analysis (12)—showed a po- 
tent inhibitory effect on WCRs. We used multistep 
chromatography to isolate the insecticidal protein 
and achieved high-purity (>80%) enrichment in 
the final step (fig. S1). The major protein band 
with an apparent molecular weight of ~10 kDa 
was digested with trypsin and subjected to iden- 
tification by liquid chromatography-tandem mass 
spectrometry (LC-MS/MS). We used the LC-MS/ 
MS data of tryptic peptides to search against a 
protein database including protein sequences 
translated from the in-house genome sequence 
of this specific P. chlororaphis isolate. A high- 
confidence match to a hypothetical protein en- 
coded by a gene from our specific P. chlororaphis 
isolate was obtained. This gene encodes a small 
protein of 86 amino acids, which we designated 
IPD072Aa (GenBank accession number KT795291). 
IPD0O72Aa was cloned and expressed in Escherichia 
coli to confirm that it was responsible for the 
insecticidal activity. Affinity-purified recombinant 
IPD072Aa was highly potent against WCR larvae 
and also active against northern corn rootworm 
(NCR) (Diabrotica barberi) larvae in diet bioassays 
(Fig. 1). 

To evaluate the activity spectrum, we tested 
purified recombinant IPD072Aa protein at con- 
centrations up to 875 pg/ml in diet assays against 
several lepidopteran and hemipteran species— 
including black cutworm (Ag7otis ipsilon Hufnagel), 
corn earworm (Helicoverpa zea Boddie), Euro- 
pean corn borer (Ostrinia nubilalis Hiibner), fall 
armyworm (Spodoptera frugiperda J. E. Smith), 
soybean looper (Pseudoplusia includens Walker), 
and western tarnished plant bug (Lygus hesperus 
Knight)—and observed no mortality or inhibitory 
effect on any of those insect species. We also used 
an Agrobacterium-mediated transient gene expres- 
sion system (13) to express IPD072Aa under control 
of a double-enhanced mirabilis mosaic virus 
viral promoter (J4) in common bean (Phaseolus 
vulgaris). Sets of leaf disks transiently expressing 
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IPDO72Aa, Cry1F (GenBank accession number 
AAE8100) as a positive control, or no insecticidal 
protein (vector backbone as a negative control) 
were infested with two or three neonates from 
each of the same five lepidopteran species used 
in the diet assays. Leaf disks expressing Cry1F 
were fully protected from feeding damage ac- 
ross all of the lepidopteran species tested. In 
contrast, leaf disks expressing IPD072Aa were 
totally consumed and comparable to the nega- 
tive control. Western blot analysis (fig. $2) in- 
dicated accumulation of IPD072Aa protein at 
3067 + 594 ng/mg of total extracted protein. 
Thus, IPD072Aa has a limited pest spectrum that 
includes corn rootworms. 

We analyzed the primary sequence of IPD072Aa 
with protein prediction tools within InterProScan 
(comprising PROSITE, PRINTS, Pfam, ProDom, 
SMART, TIGRFAMs, and PIR SuperFamilies) (15). 
No known peptide motif, domain, or signature 
was identifiable. We used the alignment tool 
BLASTP (J6) to identify proteins homologous 
to IPD072Aa in sequence databases and found 
18 matches with primary sequence identity rang- 
ing from 38 to 83% (table S1 and Fig. 1). All ho- 
mologs identified from public databases were 
annotated as hypothetical or uncharacterized 
proteins. 

The majority of these proteins (12 of 18) were 
encoded in Pseudomonas spp. Three sequences 
were deposited from metagenomics analysis of 
microbial communities populating the switch 
grass rhizosphere (Joint Genome Institute: http:// 
jgi.doe.gov), and three more IPD072 homologs 
came from Burkholderia, Xenorhabdus, and Photo- 
rhabus spp., respectively. Certain species of the 
latter two genera are known to produce high- 
molecular weight and multicomponent insec- 
ticidal protein complexes active against some 
of the lepidopteran species (17). Our findings 
suggest that additional proteins that can control 
rootworms might exist in these bacteria. To test 
this, a subset of IPD072 homologs was cloned, ex- 
pressed, affinity-purified, and evaluated against 
WCR in diet bioassays. Several of these homologs, 
including distantly related IPD072Aa protein fam- 
ily members, possess insecticidal activity (Fig. 1C). 

Transgenic corn plants were generated by 
Agrobacterium-mediated transformation (78) to 
assess IPD072Aa’s potential for controlling corn 
rootworm in planta. Transformation was performed 
using a plasmid vector (ZmIPD072Aa) contain- 
ing an expression cassette of an IPD072Aa gene 
under the control of a banana streak virus (BSV) 
promoter (see supplementary materials). Stable 
single-copy parental (To) transgenic plants (called 
events) expressing IPD072Aa were assayed in the 
greenhouse for root protection against WCR lar- 
vae, using 400 WCR eggs per plant. Root damage 
was assessed using the Iowa State University 0-3 
Node-Injury Scale (19). Plants showing low node- 
injury scores (<0.10) were retested in the first 
generation (T,) and demonstrated excellent root 
protection compared with controls (Fig. 2). In 
addition, expression of IPD072Aa in root tissue 
from these plants was detected by Western blot 
analysis (fig. S3). 
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Fig. 1. IPDO72 protein family and efficacy. 

(A) Phylogenetic tree of 19 IPDO72 protein 

sequences obtained by BLAST against the = 
National Center for Biotechnology Information, 3 
DuPont Pioneer internal, and Joint Genome 
Institute metagenomic databases. We devised iy 
a naming convention to show the protein € 
sequence identity relationship to IPDO72Aa. g 
For example, IPDO72B, IPDO72C, and IPDO72D 

are, respectively, less than 90, 80, and 70% 

identical to IPDO72Aa at the amino acid 

sequence level. For homologs within the same 

bracket of percent identity, a lowercase letter is 

added (e.g., IPDO72Fa, IPDO72Fb, IPDO72Fc, 

etc.). Numbers in parentheses represent the percent identity compared to IPDO72Aa. Black, IPDO72 
homologs from Pseudomonas spp.; blue, proteins identified in metagenomes from microbial commu- 
nities; red, proteins derived from Burkholderia (|PDO72Fb), Xenorhabdus (IPDO72Gc), and Photorhabdus 
(IPDO72Gaq). Detailed information about each protein can be found in table S1. (B) IPDO72Aa activity on 
WCRs (8-day assay) and NCRs (7-day assay). Light blue bars, concentration to inhibit 50% of the larvae 
(IC59) from survival (as dead) or normal growth (>60% reduction in size compared with control larvae); 
dark blue bars, concentration to kill 50 percent of the larvae (LCs50). Shown are values with 95% fiducial 
limits (FLs) (error bars); n = 77 to 228 larvae, not for side-by-side direct comparison. (©) WCR insecticidal 
activity charted as the values of ICs from a 6-day assay of several IPDO72 proteins. Shown are values with 
95% FLs (error bars); n = 144 to 209 larvae. IPDO72Ea and IPDO72Ga were also tested, and no activity was 
observed at a concentration of 400 pg/ml. 
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Fig. 2. Root protection from WCR injury by transgenic expression of IPDO72Aa in T, corn plants 
grown under greenhouse conditions. Node-injury scores in five transgenic events expressing IPDO72Aa 
under the control of the BSV promoter compared with nontransgenic corn (negative control) and the 
commercial corn line DAS-59122-7 expressing Cry34Ab1/35Ab1.The scatter plot shows mean values + SE 
(error bars); n = 10 plants each. (Inset) Roots of a plant expressing IPDO72Aa (right), compared with a 
negative control (left). 
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For evaluation in the field, second-generation 
(Tz) nonsegregating hybrid maize seeds, derived 
from five original Ty transgenic events of construct 


ZmIPDO72Aa, were planted at five locations across 
the USS. corn belt in 2014 (table S3). The field trials 
were conducted on land that contained late- 
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Fig. 3. Cross-resistance evaluation of IPDO72Aa against mCry3Aa and Cry34Ab1/Cry35Abl1. 
A) mCry3A- susceptible (white bars) and -resistant (gray bars) WCR larvae are inhibited by diet incor- 
poration of recombinant IPDO72Aa protein. The graph shows LCs values (in micrograms per milliliter) with 
95% FLs (error bars) and resistance ratios (RRs), defined as the ratio of LCs9 values of resistant over 
susceptible colonies. A RR of 1.7 for IPDO72Aa is unlikely to be biologically important. The asterisk indicates 
that no mortality is observed at the top concentration (647 ug/ml) for mCry3A on the resistant colony 
LCs50 > 647) and that the exact RR cannot be calculated but is much higher than 57. (B) Expression of 
PDO72Aa provides root protection from both Cry34Ab1/Cry35Ab1-susceptible (white bars) and -resistant 
gray bars) WCR larvae. Two independent transgenic events expressing IPDO72Aa were analyzed in 
Rootrainer plant-based assays. A corn plant expressing Cry34Ab1/Cry35Ab1 was used as a positive 
control; nontransgenic corn served as a negative control. Results are mean values with 95% confidence 
intervals (error bars). Bars topped with different letters differ significantly at the 0.05 significance level 
(Tukey-Kramer test). 


ET 
Table 1. Comparison of node-injury scores among treatments at all five locations in 2014. 
Estimated node-injury score: Injury from corn rootworm larval feeding was assessed with root 
scores on the lowa State University O-3 Node-Injury Scale (17). Within each location, estimated 
node-injury scores followed by different letters are significantly different (P < 0.05). Locations were 
assigned a corn rootworm pressure level based on estimated node-injury scores of negative controls 
(low < 0.75; moderate = 0.75 to 1.50; high = 1.50). 
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planted conventional corn in the previous season 
to attract corn rootworm beetles for egg laying 
and to enhance natural infestations. In addition, 
plots at all locations were manually infested with 
900 WCR eggs per plant between growth stages 
V2 and V4. On the basis of node-injury scores 
obtained from the negative control plants (table 
§3), corn rootworm larval feeding pressure was 
classified as high at Janesville (Wisconsin), Johnston 
(Iowa), and Readlyn (Iowa); moderate at Rochelle 
(Illinois); and low at Volga (South Dakota). Gen- 
eral observations from all locations indicated that 
the predominant corn rootworm species was the 
WCR. At one location (Janesville, Wisconsin), 
however, the corn rootworm species composition 
was 44% WCR and 56% NCR, as indicated by 
beetle collections from emergence cages. Another 
location (Readlyn, Iowa) showed substantial root 
injury (mean + SE node-injury score of 1.6 + 0.15, 
n = 30 plants) to a commercial hybrid corn line 
(MON88017) expressing only Cry3Bb1, which was 
planted in a separate comparative trial and ex- 
posed to natural corn rootworm infestations. The 
commercial hybrid corn line DAS-59122-7 (20), 
which expresses Cry34Ab1/Cry35Ab1, showed 
very low injury at this location (Table 1). 

At all five testing locations, plants expressing 
the IPD072Aa protein provided good root pro- 
tection from feeding injury by corn rootworm 
larvae (Table 1). Details on statistics for the fixed 
and random effects are summarized separately 
in table S4.A and table S4B. The node-injury score 
for plants from construct ZmIPDO72Aa was not 
significantly different from the root protection 
provided by the commercial hybrid corn line 
DAS-59122-7 (20) at all five locations (P > 0.05, 
Table 1). Both constructs provided good pro- 
tection compared with the negative control (P < 
0.05, Table 1). As reference, a node-injury score of 
1.0 under field conditions has been estimated to 
cause a 15 to 18% reduction in corn grain yield 
(21, 22). The presence of NCRs at Janesville did 
not negatively influence the efficacy of plants from 
construct ZmIPD072Aa, which suggests that the 
IPDO72Aa protein may be effective against NCRs 
as well. Additionally, results at Readlyn indicated a 
low likelihood of cross-resistance between IPD072Aa 
and the Cry3Bb1 protein expressed by MON88017. 
Plants from construct ZmIPD0O72Aa and DAS- 
59122-7 each provided good root protection at 
this location. 

Our field data from the Readlyn location sug- 
gest that WCRs resistant to Cry3Bb1 were not 
cross-resistant to IPD072Aa. We used bioassays 
to determine whether laboratory-selected colo- 
nies of WCRs resistant to mCry3A or Cry34Ab1/ 
Cry35Ab1 (23, 24) were cross-resistant to IPD072Aa. 
mCry3A-resistant and -susceptible WCR larvae 
were evaluated using IPD072Aa protein in arti- 
ficial diet bioassays (Fig. 3A). The mCry3A protein 
was used as a positive control. The resistance ratio 
(RR), defined as LCs values (the concentration 
needed to kill 50% of the larvae) of resistant over 
susceptible colonies, was >57 for mCry3A but only 
1.7 for IPD072Aa. A small RR of 1.7 for IPD072Aa 
is unlikely to be biologically important. To further 
evaluate any potential of weak cross-resistance 
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between this pair of insect actives, we conducted 
an additional probit analysis to assess the main 
effect of the WCR population and the interaction 
between bioassay dose and WCR population type. 
The results support no cross-resistance between 
mCry3A and IPD072Aa (P > 0.05, table S5). We 
then tested the effects of IPD072Aa on Cry34Ab1/ 
Cry35Ab1-resistant and -susceptible WCR larvae 
in a greenhouse Rootrainer plant-based assay (23). 
Compared with Cry34Ab1/Cry35Ab1 plants, plants 
from the ZmIPD072Aa construct expressing 
IPD072Aa provided good root protection from 
feeding injury by Cry34Ab1/Cry35Ab1-resistant 
WCR larvae (Fig. 3B). No significant difference 
was found in the node-injury scores caused by 
Cry34Ab1/Cry35Abi1-resistant and -susceptible WCR 
larvae on the ZmIPD072Aa plants. Results from 
the Rootrainer assay suggest no cross-resistance 
between Cry34Ab1/Cry35Ab1 and IPD072Aa. Thus, 
IPDO72Aa is effective in killing WCR insects that 
are resistant to either of the two types of Bt 
proteins used in current commercial transgenic 
corn lines. 

The discovery of the potent and selective WCR 
active protein IPD072Aa from an isolate of P. 
chlororaphis indicates that soil microbes, other 
than Bt, have potential as sources of insecticidal 
proteins. Though its insecticidal mechanism is 
under further investigation, IPD072Aa appears to 
have functional similarities to Bt-derived insec- 
ticidal proteins. IPD072<Aa is orally active against 
WCR larvae, suggesting an effect through the in- 
sect digestive system. When expressed in transgenic 
plants, IPD072Aa shows protection of corn roots 
from WCR larval injury. The ability of IPD072Aa 
to kill WCR larvae resistant to mCry3A or Cry34Ab1/ 
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Cry35Ab1 implies that its target site is different 
from each of these two Bt proteins. As resistance 
evolves among insect pests targeted by Bt corn 
seed products, IPD072Aa offers promise to effec- 
tively combat the challenges posed by continuing 
corn rootworm infestations. 
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Unique opportunities for fellows at SciLifeLab 


j 
“SciLifeLab is a beautiful workplace, both as a building and as an organization”, says Simon Elsdsser (right). “Yes, | like the spirit 


and the freshness here, that it is all new and there is a lot of enthusiasm”, says Magda Bienko (left). 


SciLifeLab, Science for Life Laboratory, is a Swed- 
ish research center within molecular biosciences 
with a focus on health and environment. To further 
strengthen the research environment at SciLifeLab 
the center regularly recruits young, talented research 
leaders to become SciLifeLab fellows. 


Each fellow is recruited by one of the center’s host 
universities and receives funding from them. The first 
call for SciLifeLab Fellows was opened in 2013 and 17 
fellows have now been recruited, mainly internation- 
ally. Three of those also received ERC starting grants 
for 2016. And it seems that being part of the SciLifeLab 
fellows program played a key role in attracting that 
funding. 


“Thanks to the funding from the SciLifeLab fellows 
program, we have been able to perform preliminary 
experiments that enabled us to show other granting 
agencies that we have potential. It really made me an 
independent fellow from the start’, says Simon Elsasser, 
fellow at Karolinska Institutet/SciLifeLab. 


Sebastian Deindl, fellow at Uppsala University/SciLife- 


Lab agrees: 
“The SciLifeLab fellows program is really quite unique 


SciLifeLab 


in the sense that it provided a complete package, in- 
cluding funding and access to advanced infrastructure, 
which gives stability to perform ambitious and chal- 
lenging science.” 


Multidisciplinary environment 

“It was very appealing for me that SciLifeLab brings 
multiple research areas from several universities to- 
gether in one building”, says Magda Bienko, fellow at 
Karolinska Institutet/SciLife- 
Lab. “It was also comforting 
to know that I would start at 
the same time as other young 
group leaders. I liked the idea 
of being able to contribute to 
shaping the environment at 
the center’, says Magda Bien- 
ko. 


“We are all ambitious, young 
scientists with different back- 
grounds, and I am excited 
and humbled to be given 

the chance to do something 
meaningful,“ says Sebastian 
Deindl. 


“The SciLifeLab fellows 
program is quite unique in 
the sense that it provides 
a complete package 
compared to most other 
places”, says Sebastian 
Deindl. 
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The fellows were recruited from three different univer- 
sities; Magda Bienko from Massachusetts Institute of 
Technology, Sebastian Deind| from Harvard Universi- 
ty, and Simon Elsasser from University of Cambridge. 
They complement each others research since they are 
all interested in chromatin and developing techniques 
to study it. Their interest spans from the architecture of 
the genome and its dynamic regulation by histone pro- 
teins to the mechanisms of chromatin-associated com- 
plexes studied at the single-molecule and atomic level. 


“We have invested the first few months at SciLifeLab 
to set up our respective methods, now there is a lot of 
room for collaboration and combining these methods’, 
says Magda Bienko. 


Access to infrastructure 

All researchers in Sweden have access to the SciLifeLab 
infrastructure. For example, Magda Bienko collabo- 
rates with the superresolution microscopy facility at 
the center. 


“Getting access to this facility was one of the very ap- 
pealing aspects of Sweden for me since my research 
rely very much on this’, says Magda Bienko. “The ex- 
pertise within the SciLifeLab facilities is really good, 
not only in terms of help with actual experiments but 
also with the theory behind. For example there is a bi- 
oinformatics drop-in support every week which we use ally good to live here’, says Simon Elsasser. “I hope we 
and that is working really well for us.” are going to make some great discoveries the following 
years, leading to exciting and unexpected findings.” 


“T have gotten so used to having access to the SciLife- 
Lab infrastructure that I don't even think about it?) says Magda Bienko agrees: 
Sebastian Deindl. “I am collaborating with the facility | “I want to stay at SciLifeLab and I wish that it remains 


for cryo-electron microscopy as well as the ones for very dynamic and I am willing to contribute to that. 
protein science and chemical biology. It would com- Science is always the top priority and I can do the sci- 
pletely change the character of my research did I not ence I want here very well so this is where I see myself 
have access to this type of infrastructure.” in the future.” 

Looking forward to a bright future “We are making some really exciting progress and it is 
It actually seems that the three fellows have come to more fun than ever in the lab,” says Sebastian Deindl. 
like SciLifeLab and Sweden so much that they plan to 

stay here. 


“T think that the future here will be very bright and I 
would like to stay in Sweden and in Stockholm, it is re- 
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And the Winner Is... 


Eppendorf & Science Prize for Neurobiology 
Congratulations to Gilad Evrony on winning the 2016 
Eppendorf & Science Prize for his work on developing 
technologies to sequence and analyze the genomes of 
single cells from the human brain. Dr. Evrony’s research 
has revealed a diversity of mutations in neuronal 
genomes indicating that every neuron in the brain 
carries a unique fingerprint of somatic mutations. Such 
mutations can cause focal brain malformations and may 
have a role in other unsolved neurologic diseases. The 
technology also allows, for the first time, reconstruction 
of developmental lineage trees in the human brain to 


study how cells proliferate and migrate to build the brain. 
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The annual US$25,000 Eppendorf & Science Prize for 
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award. He will be presented with the Prize at a ceremony 
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Mouse with a milkshake: 
Behavioral windows into 
brain function 


Neuroscientists are employing new techniques and technolo- 
gies to use animal behavior as a window into brain function. 
For instance, researchers are teaching mice to nose-poke a 
touchscreen by doling out strawberry milkshakes as part of 
cognitive testing. It’s a light-hearted reminder of the similarity 
between mice and humans, perhaps—but hopes are rising 
that new therapies developed using mouse models of Alz- 
heimer’s disease, for example, can be translated to humans. 
By Caitlin Smith 


or years, neuroscientists have gained new revelations 

about how the brain works by separately studying 

genes, neurons, or animal behavior. Yet they are surely 

far from understanding the entirety of brain function. 
Today, neuroscientists are making strides by developing new 
tools and combining existing techniques to yield fresh insights. 

One strategy is to use behavior as a specific entryway into 

learning how the brain operates. A simple behavior—like the 
patellar or knee-jerk reflex your doctor tests by tapping the 
patellar tendon in your knee—indicates the healthy functioning 
of a specific region of your central nervous system. But study- 
ing behaviors that involve multiple neural and motor pathways 
and different brain areas—such as eating and physical activ- 
ity—remains challenging. And researchers encounter even 
more obstacles when trying to analyze complex behaviors like 
social interactions. Neuroscientists are now harnessing new 
technologies and advanced computing capabilities that analyze 
both simple and complex behaviors to increase our knowledge 
of brain function. 


Body in motion, mind in motion 


As any neurologist can tell you, small movements speak 
volumes about what’s going on inside the brain. For example, 


Upcoming Features 


supination—turning the hand and forearm to the palm-up 
position—can indicate the health of the corticospinal tract, a 
band of nerve fibers in the brain that ferries information about 
limb movements to and from the spinal cord. 

Neurologists can detect brain or spinal cord damage—sus- 
tained during a stroke, for example—by evaluating a patient’s 
supination ability. “This is a classic neurological sign for corti- 
cospinal tract injury,” says Jason Carmel, director of the Motor 
Recovery Laboratory at the Burke Medical Research Insti- 
tute. Patients with damage to the corticospinal tract usually 
show arm drift when asked to hold their arms out palms-up, 
as if holding up a pizza box. But researchers can’t ask rats 
to pretend to hold a pizza box. So Carmel, also an attending 
neurologist, developed a supination task for rats to quantify the 
effectiveness of therapies — ultimately to find the best therapies 
to help his patients regain motor function. 

Carmel worked with scientists at Vulintus, whose forelimb 
assessment system MotoTrak was designed to replace the 
near-constant human supervision required for traditional motor 
assessments with precision sensors and computer monitoring 
for behavioral reinforcement. Pretraining animals for motor 
assessment tasks can take weeks or months, via traditionally 
one-on-one training, which can monopolize the lab’s time. 
MotoTrak automates almost all of an animal’s training and 
testing. Now, says Carmel, “we can get studies done more 
quickly and use a larger cohort of animals.” 

MotoTrak uses an adaptive training protocol, automatically 
adjusting the task parameters required for rewards to keep the 
animal motivated. In MotoTrak’s supination module, “animals 
are trained to reach for a doorknob-like manipulandum [object] 
and turn the knob a specified number of degrees to trigger a 
[food pellet] reward,” says Andrew Sloan, director at Vulintus. 
Once rats are fluent in the supination task, MotoTrak automati- 
cally tests how well they supinate before and after different 
therapies for corticospinal tract injury. These therapies include 
standard neurorehabilitation exercises, and future work may in- 
corporate electrical stimulation of the brain and/or spinal cord. 
Carmel hopes to use this new device to develop therapies for 
corticospinal tract injuries that will benefit his patients. 


Home sweet home 

Rodents are the most common models for behavioral stud- 
ies, and the advent of knockout technology has made mice es- 
pecially interesting subjects. Traditionally, lab mice live in home 
cages—not their natural habitat, but as “normal” as possible 
for mice who live in a lab—with researchers removing them 
temporarily to test devices such as mazes or social recognition 
chambers. But increasingly, researchers are recognizing the 
advantages of testing animals in their home cages whenever 
possible, as it reduces stress on the animals. Testing mice in a 
home cage environment is especially important when measur- 
ing behavioral parameters that might be more easily confound- 


ed by the effects of human handling or changes in environment. 


Several companies offer home cages that double as behavioral 
testing chambers. 

The PhenoTyper home cage from Noldus Information Tech- 
nology (NIT) was specifically developed for behavioral track- 
ing—a video camera and lights on top are optimized for video 
recording, eliminating shadows and illuminating the cage floor 
uniformly. “In the PhenoTyper, animals can move about, burrow 
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in bedding, eat and drink, dig holes, and play—|[it’s cozier] than 
the older test cages, which are bare plastic and far from a natu- 
ral environment,” says Lucas Noldus, founder and managing 
director of NIT. 

PhenotTyper is currently used in many behavioral studies of 
knockout mice. NIT is a partner in the Innovative Medicines 
Initiative project EU-AIMS (European Autism Interventions—A 
Multicentre Study for Developing New Medications), a consor- 
tium of academia and pharmaceutical companies searching for 
therapeutic drugs for autism spectrum disorder. NIT is provid- 
ing the PhenoTyper cage and software for preclinical studies at 
Utrecht University Medical Center, the Netherlands. Hoffman- 
La Roche, Pfizer, and other partners are testing the effects of 
drugs on behaviors in mouse models for autism. 


Putting the “move” in movie 

In conjunction with the PhenoTyper cage, NIT offers EthoVi- 
sion XT automatic behavior recognition software, which moni- 
tors rodents in their normal environment around the clock by 
video tracking. Unlike other systems that track an animal as 
one center-of-mass point, EthoVision finds the animal’s contour 
and follows it as three points—nose point, center of mass, and 
tail base—connected by two vectors, and tracks many other 
body features extracted from the video image. 

Tracking an animal using points and vectors allows EthoVi- 
sion to derive much more information automatically. For ex- 
ample, the angle formed by the two vectors (which intersect 
at the center of mass) tells you whether the animal’s body is 
stretched out straight, curled up, or turned to the left or right. 
The direction of the front vector indicates the direction an 
animal is heading (if moving), and its field of view. If the back 
vector is stationary and the front vector is moving side-to- 
side, then the animal is moving its head back and forth, scan- 
ning its environment. 

EthoVision, which automatically recognizes 10 different 
behaviors, has been used to develop animal models for 
Parkinson’s and 
Alzheimer’s diseases, and 
to screen for therapeutic 
drugs by looking for 
behavioral changes in 
the animal models, says 
Noldus. NIT also works 
with other technology 
companies, such as Data 
Sciences International, 
which manufactures 
wireless telemetry sensors. 
A small microchip-like 
sensor injected into an 
animal can transmit 
vital signs such as body 
temperature and heart 
rate to EthoVision, 
which synchronizes the 
physiological vital signs 
to the behavioral data. 
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resident animal that is defending its territory,” says Noldus, 
“and whether the stress on the animal is suppressed by 
anxiolytic drugs.” 


Mind over food: Latest behavior tracking technology 

Researcher Catherine Kotz, a professor in the Department 
of Food Science and Nutrition at the University of Min- 
nesota and the Minneapolis VA Medical Center, along with 
collaborators Charles Billington and Jennifer Teske, are also 
applying new behavior tracking technology in obesity research. 
The team applies multiple techniques in studying the role of the 
neuropeptide orexin in the brain’s regulation of eating behaviors 
and energy expenditure in rodents. 

Kotz uses Promethion system home cages from Sable Sys- 
tems, because they are identical to the animals’ home cages 
and do not require an adaptation period—which she emphasizes 
is especially important for energy balance measurements. “A 
characteristic sign of stress for these animals during acclima- 
tizing to a new environment is weight loss, so you can’t study 
weight loss,” says Kotz. “All the assumptions that go into mea- 
suring energy being used by metabolic rate and physical activity 
must be met, and this requires a stable energy balance.” And 
the Promethion system offers more sensitive measurements of 
food and water intake, body weight, physical activity, and energy 
expenditure than any system she has previously encountered. In 
addition, it allows for measurement of total energy expenditure 
instead of energy expended due to metabolic rate only. “This 
system enables us to get much better data,” she says. 

Orexin affects eating behavior as well as physical activity level, 
so Kotz wants to measure the energy expended during particular 
activities. “Other systems are OK for measuring basic metabolic 
rate, but if you want to measure the calories burned during a 
certain behavior, the other systems don’t work,” she says. The 
Promethion system lets her do this, because it can return syn- 
chronized behavioral and metabolic information in real time. It 
uses light beams in the X, Y, and Z planes so that beam breaks 
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record the animal’s movement 
path inside the cage. In addi- 
tion, the cage contains a bal- 
ance to measure body weight, 
a running wheel, and sensors 
on the food and water dis- 
pensers. This information can 
be paired with time-stamped 
energy expenditure data to 
determine the calories burned 
during a certain behavior. 
Orexin is produced in the 
hypothalamus and projects 
throughout the brain. To 
study its effects, Kotz is 
applying the technique of 
optogenetics, using mice 
engineered to express light- 
sensitive ion channels in 
orexin-producing neurons. 
When light is pulsed onto 
the brains of these mice 
via a small cable attached 
to the head, the neurons 
release this peptide. Kotz 
is also applying the cont.> 
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pharmacogenetic DREADD (designer receptor exclusively ac- 
tivated by designer drugs) approach to define orexin action for 
physical activity energy expenditure. Kotz’s lab is verifying the 
electrophysiology of the engineered mice in collaboration with 
University of Minnesota neuroscientist Mark Thomas, and it 
soon plans to study their behaviors in response to light stimula- 
tion using the Promethion system. 

Kotz is especially interested in the behavioral outcomes of 
the optogenetics and DREADD approaches because in previ- 
ous work, they introduced orexin into the brain by injection. The 
hypothalamic neurons that release orexin also release other sub- 
stances at the same time, possibly leading to different behaviors 
than would be attributable to orexin alone. With optogenetics 
and DREADD, “we'll be able to look at the physiology of the 
whole orexin neuron and everything it releases when stimu- 
lated—and the behaviors that follow,” she says. 


Of mice and men: Social behavioral monitoring 

Researchers from MRC Harwell, a Medical Research Coun- 
cil institute in the United Kingdom, are analyzing behaviors of 
knockout mice in conjunction with the International Mouse 
Phenotyping Consortium. Their goal is to look at the pleiotro- 
pic effects of knocking out every protein-coding gene in mice— 
a feat that involves characterizing over 20,000 knockouts with 
many hours of behavioral observation. “We are hoping that 
24-hour monitoring will allow us to see subtle changes to the 
behavior, feeding patterns, activity, and social interactions of 
many of our mutant lines,” says Sara Wells, director of the Mary 
Lyon Centre at MRC Harwell. 

MRC Harwell recently adopted the Actual Analytics system 
for behavioral studies of knockout mice, using the ActualHCA 
home cage environment that fits into their high-density cage 
racks. According to Patrick Nolan, neurobehavioral group 
leader at MRC Harwell’s Mammalian Genetics Unit, the sys- 
tem has two advantages: the ability to monitor behaviors in an 
undisturbed home cage environment, and the opportunity for 
group housing. Social animals by nature, mice are healthier and 
less stressed in a group-house setting than when housed alone, 
which also allows researchers to study social interactions. 

Automatic behavioral tracking of more than one animal has 
always been tricky, because the monitoring system cannot 
always distinguish animals. Actual Analytics makes this pos- 
sible using radio frequency identification (RFID) technology, in 
which a microchip is placed just under the skin of each animal 
(just like microchipping your pets at the vet). Underneath the 


ActualHCA cage is an RFID reader that sends information to a 
computer to help track the identity of each animal. An array of 
detectors and high-definition video also tracks the animals’ be- 
haviors. “Our analytical tools measure a range of spontaneous 
behaviors such as locomotion, circadian rhythms, climbing, and 
interactions between animals,” says Douglas Armstrong, chief 
science officer of Actual Analytics. “These are under constant 
development, with new behaviors regularly being added.” 

For Nolan’s group, which studies the genetic basis of neu- 
ropsychiatric disorders, the ActualHCA (home cage analysis) 
system lets them record social behaviors more accurately, and 
tests whether mixing genotypes in a single cage influences 
behaviors. He hopes that studying mouse mutants will yield 
insight into behaviors that might “provide benefits not only for 
our specific research domains, but also for neurological and 
neurodegenerative disease research in general.” 


Touchscreen technology taps into brain behavior 

At the forefront of cognitive testing in mice is touchscreen 
technology, which is generating excitement because such tes- 
ting is easily translated from mouse models to humans. Mice 
are trained in a Bussey-Saksida touchscreen chamber, named 
for its developers Tim Bussey and Lisa Saksida, both research- 
ers at Cambridge University in the United Kingdom. Inside the 
chamber is a touch-sensitive video screen that displays im- 
ages. Mice learn that when they touch the correct image(s) on 
the screen with their noses (so called “nose-pokes”), they are 
rewarded. Each correct nose-poke earns them something tasty, 
like a droplet of strawberry milkshake. Several companies offer 
Bussey-Saksida chambers, including Campden Instruments, 
Lafayette Neuroscience, Med Associates, and Stoelting. 

At Charles River Discovery (part of Charles River 
Laboratories), head of translational biology Maksym Kopanitsa 
is using these touchscreen chambers to study cognition in 
mouse models of brain disorders such as Alzheimer’s and 
Huntington’s diseases. Kopanitsa’s group wants to know 
which cognitive parameters change the most in mouse 
disease models, so that Charles River’s clients can then use 
those parameters to test novel drug treatments—perhaps 
finding a drug that reverses the altered parameter back to that 
of normal mice. 

The touchscreen tests used with mouse models of Alz- 
heimer’s disease resemble those used in human Alzheimer’s 
patients, making translation easier. “Touchscreen technology 
allows one to evaluate mouse models in tasks that are directly 
relevant to human learning and memory, and to test how well 
drugs improve performance on those tasks,” says Kopanitsa. 
He hopes to eventually perform the touchscreen experiments 
simultaneously with electrophysiological measurements of brain 
activity, in order to observe neuronal circuits engaged during 
cognition. “Having these measurements will essentially close 
the circle by providing evidence that a mouse model reproduc- 
es a human brain disorder behaviorally and physiologically.” 

Armed with touchscreen tests and other techniques, today’s 
neuroscientists are opening new behavioral windows into how 
the brain functions. As researchers come to better understand 
the link between function and behavior, discoveries may be 
translated into better therapeutics for some of our most devas- 
tating medical disorders. 


Caitlin Smith is a freelance science writer based in Portland, Oregon. 
DOI: 10.1126/science.opms.p1600109 
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CRISPR Knockout Kit 

OriGene offers genome-wide gene 
knockout/knockin kits via CRISPR 
(human and mouse). A kit is available 
against every human gene locus, 
containing two guide RNAs (gRNAs) 
and a compatible donor vector with a 
functional cassette. In addition to gene 
knockout, these kits can also be used 
for promoter strength study for each 
gene locus. The predesigned nature of 
the kit greatly reduces the researcher’s 
time and effort. CRISPR/Cas9-based 
genome editing has become a popular 
tool for targeted genome manipulation 
because of its simplicity and high 
cutting efficiency. This system requires 
a functional Cas9 protein and a gRNA 
for effective double-stranded breakage 
at a desired site. OriGene has 
developed the pCas-Guide system, a 
dual-function vector with both gRNA 
and Cas9 expression. OriGene has 
also designed a set of donor cassettes 
for construction of donor vectors. 
These cassettes include Luciferase- 
Loxp-Puro-Loxp, tGFP-Loxp-Puro- 
Loxp, and tRFP-Loxp-BSD-Loxp. 
OriGene Technologies 

For info: 888-267-4436 
www.origene.com 


Small Animal Physiological 
Monitoring System 

The Harvard Apparatus Small Animal 
Physiological Monitoring System inte- 
grates monitoring of rectal tempera- 
ture, electrocardiogram, respiration, 
oxygen saturation, blood pressure, 
and exhaled CO,. Designed to make 
surgery and other manipulations on 
small animals easier, the system pro- 
vides superior monitoring of multiple 
physiological parameters on a single 
small platform. It features a real-time 
display of numeric data and user- 
defined graphic waveforms. An ad- 
vanced wireless interface and touch- 
screen display allow the user to view, 


add notes, and save and export data for easy analysis. The 
system’s controlled heating platform maintains the animal’s 
body temperature at the desired level. A stereotaxic adaptor 


option is also available. 
Harvard Apparatus 

For info: 800-272-2775 
www.harvardapparatus.com 


Neural Basal Medium 

PRIME-XV Neural Basal Medium is a 
chemically defined medium that sup- 
ports long-term cultures of human 
and rodent neuronal stem cells for 
neurological research. Neural progeni- 
tors are sensitive to culture conditions 
and difficult to obtain. Compared to 
media containing serum or undefined 
components, PRIME-XV’s serum-free 
formula is designed to increase ex- 
perimental reproducibility and culture 
consistency to maximize the use 

of these cells. The ability to culture 
primary and reprogrammed (induced 
pluripotent stem cell-derived) neural 
progenitors is offering neurological 
researchers better ways to screen 
drugs for toxicity, understand brain 
cancer, and use patient cells for dis- 
ease modeling to advance the treat- 
ment of neurological diseases such as 
Alzheimer’s, Parkinson’s, and epilepsy. 
PRIME-XV Neural Basal Medium 

can be used with PRIME-XV IS21, a 
serum-free supplement for neuronal 
culture that is prescreened for efficacy 
and avoids the risks that serum pre- 
sents to neuronal cultures. 

Irvine Scientific 

For info: 800-577-6097 
www.irvinesci.com 


iPSC-Derived Astrocyte Kits 

Axol Bioscience has expanded its 
range of neuroscience research tools 
to include an induced pluripotent 
stem cell (iPSC)-Derived Astrocyte 
Progenitor Kit and iPSC-Derived Ma- 
ture Astrocyte Kit. These kits include 
cryopreserved cells, optimized media, 
and supplements. The new kits offer 
researchers a complete culture sys- 
tem that is easy to use and promotes 
long-term cell viability for use in dis- 
ease modeling and drug discovery 
applications. Astrocytes are capable 
of lipid uptake and serve to support, 
repair, and maintain the central ner- 
vous system by providing essential 
nutrients that encourage neuronal 
maturation. Astrocyte dysfunction has 
been implicated in a number of neuro- 
logical conditions such as Alzheimer’s 
and Parkinson’s diseases, autism, 
amyotrophic lateral sclerosis, Rett 
syndrome, and schizophrenia. Axol 
iPSC-Derived Astrocyte Progenitor 
Kits are designed to generate mature 
astrocytes in under 3 weeks, and 
iPSC-Derived Mature Astrocytes ex- 
press prototypical phenotype markers 
such as glial fibrillary acidic protein. 
Axol Bioscience 

For info: 800-678-2965 
www.axolbio.com 


Laser Beam Combining Filters 

New Multiphoton LaserMUX beam 
combiners enable deeper tissue 
imaging and improved contrast in 
multicolor and multimodal fluores- 
cence microscopy. The filters set new 
performance standards by simultane- 
ously achieving high transmission, 
high reflection, and low group delay 
dispersion (GDD) over both reflection 
and transmission, while maintaining 
minimal wavefront distortion. These 
beam-combining filters allow users 
to achieve higher transmission and 
reflection levels over an expanded 


wavelength range, with wavefront performance optimized for 
deep tissue and in vivo imaging. Ideal for combining two fem- 
tosecond pulsed laser beams, they are perfect for optogenetics 


and other life science applications. 


IDEX Health & Science 


For info: +49-(0)-1801-808-800 


www.idex-hs.com 
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The right Western choices can take the 
“ugly” out of your next blot. 


Western blotting tools from MilliporeSigma help tailor your choices to your target: 
e Ultradurable, tear-resistant TruPAGE™ precast gels 


e Complete selection of PVDF and nitrocellulose membranes to help you make the right choice 
for transferring your protein of interest 


e 30 minute immunodetection with the SNAP i.d.® 2.0 system for cleaner, more consistent data 
e Ready-to-use reagents like the ultrasensitive Luminata™ substrate 
Application-specific antibody manufacturing expertise, with over 70,000 tested in Western blot 


Show us your ugliest blots. We'll provide tips and tricks for revealing beautiful Western data 
in our Protein Blotting Handbook, 6th edition. 


sigma-aldrich.com/westernblot 


P.S. If your blot has a big fingerprint in the middle... 
it might actually be you. 


The life science business of Merck KGaA, Darmstadt, Germany 


operates as MilliporeSigma in the U.S. and Canada. PRALUPRPORE 
SiGha 


MilliporeSigma, Luminata, TruPAGE and the vibrant M are trademarks and Millipore and SNAP i.d. are registered trademarks 
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The beauty of outreach 


fter about 20 years as an academic researcher in experimental condensed matter physics, I got 
involved in outreach quite by chance. It happened a couple years ago, when some friends were 
writing a proposal for their company to run “edutainment” activities for the children in our town. 
Teaching at the university level had been an important part of my job for years, but I had never 
been involved in anything like what they were proposing. Nonetheless, it sounded like fun, so I 
suggested a partnership between their company and my university’s physics department. This is how I 
discovered the joy of outreach, which has ended up taking my career in an unexpected direction. 


When the proposal was funded, I 
offered to help set up and supervise 
the activities—simple experiments 
that explored basic concepts such 
as light and energy using balloons, 
plastic bottles, batteries, and other 
everyday materials. With no experi- 
ence in this kind of environment, I 
didn’t know quite what to expect. As 
I worked with the children, I was in- 
terested to discover that they gained 
as much from their interactions as 
from the content of the activities. 
Their excitement grew as they devel- 
oped a community by collaborating 
or, more often, engaging in friendly 
competition. I found that I greatly 
enjoyed facilitating this kind of fun 
and socializing through physics. 
After hearing about my work on 
these edutainment activities, the 
director of the physics department 
asked me to coordinate our departmental outreach pro- 
gram’s main project, which aims to enrich physics education 
for students in their final years of high school. I was grate- 
ful for this opportunity, which allowed me to make outreach 
part of my professional life. I have also found, unexpectedly, 
that it has been even more important on a personal level. It 
has helped me find a new level of community and connec- 
tion with students, which has reignited my own excitement 
about physics and my appreciation of the relationships that 
can emerge from sharing in the scientific inquiry process. 
Working with the graduate students involved in the proj- 
ect has shown me how collaboration can create a vibrant, 
creative community, which simultaneously makes the work 
more fun and leads to a better final result. When I started 
on the project, graduate students in the department merely 
helped conduct established educational activities, but soon, 
on their own initiative, many of them started to propose 
new activities related to their research. In turn, we started 


“It has helped me find 
a new level of community 
and connection.” 


to work more as a team. Now, our 
meetings feature lively discussions 
about the status of the outreach 
project and possible directions for 
future development. Everyone con- 
tributes, leading to warm, meaning- 
ful personal interactions. Seeing the 
students thrive in this environment 
has made me realize that our out- 
reach program serves not merely to 
explain scientific concepts, but also 
to share the idea that participating 
in science is a positive, enriching 
personal experience. 

A few months ago, when the new 
director of the department asked 
me to take on the responsibility 
for the entire program, I accepted 
enthusiastically, effectively choos- 
ing outreach as my primary activity. 
I still plan to continue my research, 
because these activities are comple- 
mentary and I like the way they complete my work life, but 
I know that doing both outreach and research will be a chal- 
lenge. Nonetheless, I’m confident that it will be worthwhile, 
and I look forward to continuing to develop our outreach 
program. I want to engage the high school students as well 
as educate them so that they can experience the joy of shar- 
ing their scientific enthusiasm in a communal environment. 

It all started for fun, but in just a couple years, my work 
life has completely changed. Now, when school students ask 
for advice about pursuing a career in science, I tell them 
that the beauty of science is engagement and collaboration 
with a diverse group of individuals. Thanks to outreach, I 
rediscovered this beauty. 


Pierfrancesco Riccardi is a researcher at the University 
of Calabria in Italy. He thanks the students and postdocs 
involved in the Progetto Lauree Scientifiche. Send your 
career story to SciCareerEditor @aaas.org. 
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